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Remarks 

In view of the above amendments and the following remarks, reconsideration 
of the outstanding office action is respectfully requested. 

Claim 1 has been amended, claims 3-5 have been cancelled without prejudice, 
and new claims 10-21 have been added. Descriptive support for new claims 10 and 1 1 
appears in the first full paragraph on page 30 and the third full paragraph on page 35, 
respectively; descriptive support for new claims 12-14 appears in the first full paragraph on 
page 34; and descriptive support for new claims 15 and 16 appears in the first full paragraph 
on page 30. New claim 17 finds descriptive support in original claim 3 (i.e., claim written in 
independent form), and new claims 18-21 find descriptive support in original claims 4 and 6- 
8, respectively. Claims 1, 2, and 6-21 are pending. 

The objection to the specification is overcome by the above amendments. 
Although applicants disagree with the assertion made by the U.S. Patent and Trademark 
("PTO"), the present claim language is clearly supported by the first full paragraph on page 
30, along with the disclosure of the nucleic acid sequence of SEQ ID NO: 177 and the 
corresponding amino acid sequence of SEQ ID NO: 178. 

The objections to claims 1 and 5 are overcome by the above amendments and 
should be withdrawn. 

The rejection of claims 1-9 under 35 U.S.C. §112 (first paragraph) as lacking 
written descriptive support is respectfully traversed. 

The burden of establishing that an application lacks adequate written 
descriptive support falls on the PTO. See In re Wertheim, 541 F.2d 257, 263, 191 USPQ 90, 
97 (CCPA 1976) ("[T]he PTO has the initial burden of presenting evidence or reasons why 
persons skilled in the art would not recognize in the disclosure a description of the invention 
defined by the claims."). Hence, the PTO must demonstrate why the disclosure is 
insufficient. 

The Federal Circuit has clearly espoused that per se conclusions of written 
description violations cannot be founded upon the basis of genus size alone. See Enzo 
Biochem, Inc. v. Gen-Probe Inc., 296 F.3d 1316, 1326-27, 63 USPQ2d 1609, 1614-15 (Fed. 
Cir. 2002) (refusing to adopt position that three species as a matter of law cannot satisfy 
written description requirement for significantly larger genus). Thus, the PTO's conclusion 
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cannot be based on genus size alone. But that is precisely what the PTO has done at pages 3- 
4 of the outstanding office action. Because the PTO's position is unsupported by law and 
unsupported by any facts other than genus size, applicants submit that the PTO's position 
cannot be sustained. 

In contrast, applicants present Exhibits 1-3 (attached hereto) as evidence that 
the nucleic acid sequence of SEQ ID NO: 177 and the corresponding amino acid sequence of 
SEQ ID NO: 178 represent the claimed genus. Exhibit 1 is a presentation of Genbank 
accessions for Bacillus or Geobacillus (formerly Bacillus) holA nucleic acids that are 
homologous to the nucleotide sequence of SEQ ID NO: 177. These ho I A nucleic acids were 
identified by a protein-protein BLAST search of the Genbank database performed using the 
amino acid sequence of SEQ ID NO: 178 and the BLAST default settings. Homologous 
sequences were identified in Geobacillus kaustophilus, Bacillus cereus, Bacillus anthracis, 
Bacillus thuringiensis, Bacillus weihenstephanensis, Bacillus subtilis, Bacillus licheniformis, 
Bacillus halodurans, and Bacillus clausii (Exhibit 1). Based upon alignments performed 
using Align® for nucleic acids and ClustalW for amino acids (using the European Molecular 
Biology Laboratory server and its default settings), these homologs share between about 51 
and about 98 percent identity at the nucleic acid level (Exhibit 2) and between about 37 and 
about 99 percent identity at the amino acid level (Exhibit 3). Thus, species of delta subunits 
from organisms that belong to the biological classification Bacillus or Geobacillus clearly 
share similar structure and, therefore, function. 

Applicants submit that the language recited in claims 1 and 9 is precisely the 
type of claim language that was acknowledged in Univ. of California v. Eli Lilly 3 119 F.3d 
1559, 43 USPQ2d 1398 (Fed. Cir. 1997) as being acceptable under the written description 
requirement. In Eli Lilly, the Federal Circuit addressed the validity of several claims of U.S. 
Patent No. 4,652,525 to Rutter et al. ("Rutter"), specifically those claims that recited the 
limitations 'vertebrate,' 'mammalian,' or 'human' cDNA for insulin. Rutter disclosed the 
nucleotide and amino acid sequences of a rat cDNA encoding insulin, but merely described a 
general procedure for obtaining the human cDNA encoding insulin. Id. at 1567, 43 USPQ2d 
at 1405. The Federal Circuit found that the description of the rat cDNA did not provide 
adequate descriptive support for the narrow subgenus of 'human' cDNA (no species 
disclosed), the larger subgenus of 'mammalian' cDNA (only the one rat species disclosed), 
and the larger genus of 'vertebrate' cDNA (only the one rat species disclosed). Id. at 1567- 
68, 43 USPQ2d at 1405. The Federal Circuit did acknowledge, however, the district court's 
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statement that the specification provided adequate written descriptive support for the 
subgenus of 'rat' cDNA encoding insulin. Id. at 1566. 

Thus, functional language should be acceptable when the genus as claimed is 
sufficiently limited in scope (i.e., from Bacillus or Bacillus stearothermophilus) and the 
specification describes one or more species within that genus. Claims 1 and 9 recite the same 
type of functional claim language that was identified as acceptable in Eli Lilly given the 
description of a single species by its nucleotide sequence. Thus, it should be evident that 
claims 1 and 9 (and claims dependent thereon) find written descriptive support in the present 
application. 

It should be noted that the "Guidelines for Examination of Patent Applications 
Under the 35 U.S.C. 1 12 U 1, 'Written Description' Requirement," make explicitly clear that 
the description of a representative number of species does not require the description to be of 
such a nature that it would provide support for each species that the genus embraces. 66 Fed. 
Reg. 1099, 1 106 (2001). Hence, the absence of sequences (in the present specification) for 
the later-identified holA and delta homologs is irrelevant to the issue of whether the present 
specification provides adequate written descriptive support for their use in accordance with 
the present invention. 

Moreover, the conclusion by the PTO is contrary to evidence submitted 
herewith by applicants. As demonstrated by Exhibits 1-3, one of ordinary skill in the art 
would have understood that applicants were in possession of the presently claimed invention 
at the time the present application was filed. This is so, because persons of skill in the art 
would have expected sufficiently related organisms from the genus Bacillus (and now 
Geobacillus) to possess homologous hoi A nucleotide sequences or delta subunit proteins. 
Exhibits 1-3 confirm this expectation to have been reasonable. 

In view of all of the foregoing, applicants submit that the rejection of claims 1- 
9 is improper and should be withdrawn. 

The rejection of claims 1-9 under 35 U.S.C. §112 (first paragraph) for lack of 
enablement is respectfully traversed. 

It is the position of the PTO that the specification does not provide sufficient 
guidance for making and using other delta subunit-encoding DNA molecules within the scope 
of the claims. Applicants respectfully disagree. 

The PTO is respectfully reminded that all that is needed is objective 
enablement of what is claimed. In re Wright, 999 F.2d 1557, 1561, 27 USPQ2d 1510, 1513 
(Fed. Cir. 1993). The present application provides the nucleotide sequence of Bacillus (now 
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Geobacillus) stearothermophilus holA (e.g., SEQ ID NO: 177) and describes how one of 
ordinary skill can isolate homo logs of the disclosed sequence {see page 41, line 9 to page 42, 
line 29), express the delta subunit encoded by such homologous holA sequences {see 
Example 19, expressing A aeolicus delta subunit), and test the encoded delta subunit for 
clamp loader assembly competence {see Examples 24 and 25, testing A. aeolicus clamp 
loader assembly) and for clamp loader activity {see Examples 26 and 30, testing A. aeolicus 
clamp loader activity). Thus, one of ordinary skill in the art would have been fully able to 
make and use DNA molecules and their encoded proteins within the scope of the presently 
claimed invention. 

Moreover, with regard to method 3 for homo log identification, described at 
page 42, that is precisely the approach used to identify the holA homologs shown in Exhibit 1 
(i.e., from other Bacillus or Geobacillus organisms). For this reason, it should be apparent 
that the present application fully enables the production and use of other species of Bacillus 
or Bacillus (now Geobacillus) stearothermophilus holA homologs. 

In view of all of the foregoing, applicants submit that the rejection of claims 1- 
9 for lack of enablement is improper and should be withdrawn. 

Because 1 is allowable for the reasons noted above, applicants further submit 
that new claims 10-16 also are allowable. Consistent with the PTO acknowledgments at 
pages 3-4 the outstanding office action, applicants further submit that that the specification 
provides written descriptive support for and enables the claimed DNA molecules that encode 
the delta subunit including the amino acid sequence of SEQ ID NO: 178 (i.e., claims 17-21). 

In view of all of the foregoing, applicant submits that this case is in condition 
for allowance and such allowance is earnestly solicited. 

Respectfully submitted, 

Date: August 24, 2006 /Edwin V. Merkel/ 

Edwin V. Merkel 
Registration No. 40,087 

NIXON PEABODY LLP 
Clinton Square, P.O. Box 31051 
Rochester, New York 14603-1051 
Telephone: (585)263-1128 
Facsimile: (585)263-1600 
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BCT 04-DEC-20O4 



BA000043 1047 bp DNA linear 

Geobacillus kaustophilus HTA426 DNA, complete genome. 
BA000043 REGION: 2532557 2533603 
BA000043.1 GI: 56378377 

Geobacillus kaustophilus HTA426 
Geobacillus kaustophilus HTA426 

Bacteria; Firmicutes; Bacillales; Bacillaceae; Geobacillus. 

1 

Takarni,B-, Takaki,Y., Chee,G.J-, Nishi,S., Shimamura, S ♦ , Suzuki, H. f 
Matsui,S. and Uchiyama,!. 

Thermoadaptation trait revealed by the genome sequence of 
thermophilic Geobacillus kaustophilus 
(er) Nucleic Acids Res. 32 (21), 6292-6303 (2004) 
15576355 

2 (bases 1 to 1047) 
Takami,H., Takaki,Y. and Chee,G. 
Direct Submission 

Submitted ( 25- JUN-2003 ) Hideto Takami, Japan Marine Science and 
Technology Center, Microbial Genome Analysis Research Group; 2-15 
Natsushima-cho, Yokosuka, Kanagawa 237-0061, Japan 
{E-mail : takamih@ jamstec. go.jp, 

URL : http : / /www , j amstec , go . j p/ j amst ec-e/bio/exbase . html, 
Tel: 81-4 6-8 67-964 3, Fax: 81-4 6-8 67-964 5) 
Location /Qualifiers 
1. .1047 

/organism="Geobacillus kaustophilus HTA426" 
/mol_type="genomic DNA" 
/strain-"KTA426" 

/isolation_source="isolated from the deepest Ocean" 
/rib_xref~"taxon: 235909 " 
/note^" t hermoph i I e " 
complement (1 * . 104 7 } 
/locus_tag«"GK2513" 
compl ement ( 1 - .104 7) 
/locus_tag-"GK2513" 
/codon_start=l 
/transl__table=ll 

/product-"hypothetical conserved protein" 
/ P rotein_id= M BAD7€798. 1 " 
/db xref=' r GI: 56380890" 

/tr!nslation-"MLERVMGNIEKRRFSLLYLLYGNEPFLLTETYERLVNAALGPEE 
REWNLAVYDCEETPVEAALEEAETVPFFGERRVILIKHPYFFTSEKEKE1EHDLAKLE 
AYLKAPSPFSIWFFAPYEKLDERKKITKLAKEQSEWIAAPLAEAELRAWVRRRIES 
QGAQAS DEA I DVLLRRAGTQL S ALANE I DKLALFAGS GGT 1 EAAAVE RL VART PEE N V 
FVL VEQ VAKRD I PAALQT FYDLLENNEEP I KI LALLAAH FRLLS QVKWLAS LG YGQAQ 
I AAALKVE PFRVKLALAQAARFADGELAEAI NELADADYEVKSGAVDRRLAVELLLMR 
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WGTRPAQAGRHGRR" 

ORIGIN 

1 tcaccgccgg ccgtgacgcc ccgcttgcgc cgggcgggtg ccccaccgca tcagaagcag 

61 ctcaacggcc aaccggcgat cgaccgctcc gcttttcact tcgtaatcgg cgtcagcgag 

121 ctcgttgatc gcctcagcaa gctctccgtc agcgaagcgg gccgcttgag caagagcgag 

181 cttgacgcgg aacgggtgca ccttgagcgc cgcagcaatt tgcgcctgcc cgtagcctaa 

241 ggaggcaagc catttcactt gcgaaagcaa gcggaaatgg gcggcgagca gcgccaaaat 

301 tttgatcggc tcttcattgt tttcaagcag atcataaaac gtctgcaacg ccgctggaat 

361 gtcgcgcttc gccacttgct cgacaagcac aaatacgttt tcttccggcg tgcgggcgac 

421 aagccgttca accgccgccg cctcgatggt tccgcccgat ccggcaaaca gggccaattt 

481 atcgatttca ttcgccaagg cggaaagctg cgtcccggcc cgccgcaaca ggacatcaat 

541 cgcctcgtcg cttgcttgcg ccccttggct ctcgatgcgg cgccgcaccc aggcgcgcag 

601 ctccgcttcg gcgagcgggg cggcgatgac gacttcgctt tgctctttgg cgagcttcgt 

661 aatttttttt cgctcatcaa gcttctcgta cggcgcgaaa aagacgacga tcgaaaacgg 

721 cgacggcgcc ttcaagtacg cctccagctt cgccaaatca tgttcgatct ccttctcttt 

781 ttcagacgtg aaaaaatatg gatgcttgat gaggatgaca cgccgctcgc cgaaaaacgg 

841 caccgtctcg gcctcctcaa gcgccgcctc gaccggcgtt tcctcgcagt cgtacacagc 

901 caagttccac tcccgctcct cggggccaag cgctgcgttc accaatcgct catacgtttc 

961 cgttaataaa aacggctcat tgccgtataa taaataaaga agagaaaaac gccgtttttc 
1021 aatgtttccc catacgcgtt ccagcat 

// 
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1 tt 



NZ AAOX01000056 1041 bp DNA linear BCT 01-MAR-2006 

Bacillus sp. NRRL B-14911 1099999053110, whole genome shotgun 

sequence- een\ 
NZ AAOX01000056 REGION: complement ( 1 05 1 9 1155 9 ) 

NZ^AAOXOl 00 0056.1 01:89101064 

WGS. 

Bacillus sp, NRRL B-14 911 
Ba cillus sp. NRRL B-14 911 

Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 
1 (bases 1 to 1041) 

Siefert,J\, Ferriera,S,, Johnson, J., Kravxtz,5,, Halpern,A., 
Remington, K. , Beeson,K., Tran,B., Rogers, Y . -H . , Friedman, R. and 
Venter, J. C. 

Direct Submission ^ ^- t 

Submitted ( 28-FEB-2006 } J Craig Venter Institute, 9704 Medical 

Center Drive, Rockville, MD 20850, USA 

Annotation was added by the NCBI Prokaryotic Genomes Automatic 
Annotation Pipeline Group. Information about the Pipeline can be 
found here * 

h ^ r . /A^^nhi. nlm.nih.gov /Qenomes/st Please be 

a^are~~that the annotation is done automatically with little or no 

manual curation. 

Location /Qualifiers 

1. .1041 

/organism="Bacillus sp, NRRL B-14911" 
/mol_type= n genomic DNA" 
/strain-"NRRL B-14911" 
/dbjxref ="taxon : 313627 " 
1 , , 1041 

/locus_tag-"B14 911_06€38" 
1 . . 1041 

/locus_tag-"B14 911_06638' 1 

/note-="COG14 66 DNA polymerase III, delta subunit" 

/codon_start=l 

/trans l_table^l_l 

/product="DNA polymerase III subunit delta" 

/protei n_J rt~"ZP 01173914 .1" 

/db xref-"GI: 89101075" 

/translation="MVLDVWKQINNKQFTPIYLLYGTEGFLINETKQLLINNVLAEDE 
SDFNLSTYDLEETPIEAALEDAETFPFMGDKRLIFLHNPSFLTSEKTKGKTEHNLAKL 
ESYLKEPSPYSIIVFAAPYEKLDDRKKITKELKKKASVLEAKKLGEQELKVWIRERAA 
ONGVQIEEDAVQLMLTLAGTNLFMLTGEI DKLALYADEEKLI DAEAVDKLVARSLEQN 
IFTT.VDKWHRKIDEALRIYYDLLKQNEEPIKILSILTGQFRLIYQVKELSRRGYGQQ 
QTAGYLKVHPFRVKLAAGQAQLFGDEELSRIMKLLAEADYEMKTGGMNKEMLIEMFLF 

RLHDHALHKQTK" 

ggttttag atgtgtggaa gcaaataaat aacaaacaat ttaccccgat atatttatta 
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61 tacggaacag aaggcttttt gattaatgaa acaaaacagc tgcttatcaa caatgtgctg 
121 gctgaagacg agagcgattt taatctgtct acatatgatt tggaggaaac accaatagaa 
181 gcggcacttg aggatgcaga gaccttccct ttcatggggg ataagaggct catctttctt 
241 cacaacccct cttttctgac ttccgaaaag acaaagggga agactgagca taatctggcc 
301 aagctggaat cctatttaaa agagccgtcg ccatattcaa ttattgtgtt tgcggcaccc 
361 tatgaaaagc tggatgatag aaaaaagatt acgaaggaac tgaagaaaaa agcaagtgtc 
421 cttgaggcca aaaaactggg ggagcaggag ctgaaagtct ggatcaggga aagggctgcc 
481 cagaacggcg tccagattga ggaggatgcc gtgcagctta tgctgactct ggcaggtacc 
541 aacctcttta tgctgacagg tgaaattgat aaattggcct tatatgccga cgaggaaaag 
601 ctgattgatg ccgaggcagt tgacsagctg gttgccaggt cgcttgagca gaatattttc 
661 acccttgtcg acaaggtggt ccacagaaaa atagatgagg cactgcggat ttattatgat 
721 cttttaaagc aaaatgaaga gccgattaaa attctttcca ttctgaccgg ccaattcagg 
781 ctgatctacc aggtgaagga gctttcgaga aggggatacg gccagcagca gacggccgga 
841 tacttaaagg tccatccctt cagggtcaag cttgcagccg gacaggcoca gcttttcgga 
901 gatgaagaac tgtcccgcat catgaagctt ctggcagaag cagattatga gatgaaaaca 
961 ggggggatga acaaggaaat gcttattgag atgtttttgt tcaggctcca cgaccatgcc 
1021 ctccacaaac agacaaaata a 

// 
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COHSRTM 

TITLE 

JOURNAL 

COMMENT 



NZ_AALL01Q00072 1011 bp 

Bacillus cereus subsp. cytotoxis 
shotgun sequence. 

NZ AALL010G0072 REGION: complement { 4 22 3 ♦ . 5233 ) 
NZ~AALL01000072 . 1 GI:89203481 
WGS- 

Bacillus cereus subsp. cytotoxis 



DNA linear BCT 06-MAR-2 00 6 

NVH 391-98 ctg92, whole genome 



NVH 

Bacillus cereus sub sp. cy totoxis NVH 



391-98 
•391-98 



Bacteria; Firmicutes; Bacillales; Baciilaceae; Bacillus; Bacillus 
cereus group. 

1 (bases 1 to 1011) 

Lapidus,A w Goltsman, E . / Copeland,A., Lucas, S., Barry, K. , 
Detter, J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., 
Tice,H. r Bruce, D., Pitluck,S., Richardson, P . , Auger, S., 
Galleron,N., Sanchis,V., Gohar,M., Broussole , V . , Brillard, J., 
Guinebretiere # M.-H. f Ehrlich, S . D . , Lereclus,D., Aymerich,S,, 
Nguen-the,C. and Sorokin,A. 
US DOE Joint Genome Institute (JG1-PGF) 

Sequencing of the draft genome and assembly of Bacillus cereus 

subsp. cytotoxis NVH 391-98 

Unpublished 

2 (bases 1 to 1011) 
Larimer, F. and Land,M. 

US DOE Joint Genome Institute (JGI-ORNL) 

Annotation of the draft genome assembly of Bacillus cereus subsp. 

NVH 391-98 

Unpublished 

3 (bases 1 to 1011) 

Lapidus,A., Golt sman, E . , Copeland,A,, Lucas, S., Barry, K. , 
Detter, J-C, Glavina del Rio,T., Hammon,^., Israni,S., Dalin,E., 
Tice,H., Bruce, D., PitZuck, S., Richardson,?., Auger, S., 
Galleron,N., Sanchis,V., Gohar,M., Broussole, V. , Brillard, J., 
Guinebretiere, M.-H. , Ehrlich, S . D . , Lereclus,D., Aymerich,S., 
Nguen-the,C* and Sorokin,A. 
US DOE Joint Genome Institute (JGI-PGF) 
Direct Submission 

Submitted ( 02-MAR-2006) US DOE Joint Genome Institute, 2800 
Mitchell Drive B100, Walnut Creek, CA 94598-1698, USA 
URL — http: //www. jqi . doe ..gov 

Contact; Paul Richardson (microbes@cuba.jgi-psf.org) 
Draft sequencing done at US DOE Joint Genome Institute 
Source DNA and bacteria available from Alexei Sorokin 
(alexei.sorokine@jouy.inr a. fr) 

The JGI and collaborators endorse the principles for the 
distribution and use of large scale sequencing data adopted by the 
larger genome sequencing community and urge users of this data to 
follow them. It is our intention to publish the work of this 
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project in a timely fashion and we welcome collaborative 

interaction on the project and analysis. 

( http: //www, genome . qov/paqe ^cJmJ?^a^eJCi ^lQ5Q637 6 ) 

Notes : 

Bacillus cereus NVH 398-91 was originally isolated as a contaminant 
of vegetable puree in a retirement house in Essone (France) by 
Marie-Laure De Buyser (AFSSA, France) . The strain was identified as 
a cause of a severe food poisoning case. It was further 
characterized biochemically and a novel cytotoxin K was identified 
as a major cause of epithelial cell lysis leading to necrotic 
enteritis (Lund et al, Mol Microbiol, 2000, 38, 254). Although the 
strain was identified as Bacillus cereus, it appeared to be 
different from all known strains. Detailed phylogenetic studies 
using MLST approach placed the strain rather far from main strain 
clusters, but confirmed that the strain belongs to the B. cereus 
group (Sorokin et al, Appl Env Microbiol, 2006, 72, 1569) . Genomic 
sequence clearly indicated that the strain is very different from 
other representatives of the group and should be assigned a new 
species status, for which we propose a name Bacillus cytotoxis* 
However, assigning of the novel species status is hampered by the 
fact that only one such strain is described at the moment of 
genomic sequencing being done {2005} . 
FEATURES Location/Qualifiers 
source 1*.1011 

/organism="Bacillus cereus subsp. cytotoxis NVH 391-98" 

/mo l_type= ,r genomic DNA" 
/strain="NVH 391-98" 
/db_xref ="taxon : 315749 " 
1 . .1011 

/locus tag="Bcer 98 DRAFT 3801" 

CDS 1 - * 1011 

/ 1 o cu s_ t a g - " B c e r 9 8 DRAFT_3 8 0 1 " 

/codon_start=l 
/translatable- 11 

/product^" DNA polymerase III, delta subunit" 
/prot ein_id= " ZP 011 82145.1 " 
/db_xref="GI : 8 92034 87" 
/db_xref^"InterPro: IFR005790 " 
/db xref^"InterPro: 1PR010372 " 

/translat ion="MSDIHKKIKKKQFAPFYLLYGTEAYFlNETlDLITVEALAEEDR 
DFNIVTYDLEEAYLEDVVEDARTLPFFGERKILLIKSPLFLTSQKEKLEQN1KILEEY 
1AEPSPFSIM1FIAPYEKLDERKKITKLLKKTAEVVEANAHQVQDVRKW1VKRAEEVH 
VHVEE7VAVSLLLELVGSNVTMLAKEMDKLTLYVGTGGDITTKLVSELVPKSVEQNVFA 
LTEKVVKKDIAGAMKILDGLFTQQEEPIKLLALLVSQFRLLHQVKELQQRGYGQNQ1A 
SHI G VH P Y RVKLAMNQ T KL F S FE EL KKV1 FELAE AD Y S MKTG KM D KKLVL E F FLMRLN 
HM" 

ORIGIN 

1 atgagtgata tacataagaa aattaaaaag aaacagtttg ctccgtttta tttattgtat 
61 ggaacggaag cctattttat aaatgaaaca atcgatttaa ttacagtaga ggcacttgcc 
121 gaagaagatc gtgattttaa tattgttaca tatgatttag aagaggcgta tttagaggat 
181 gtagtagagg atgcaaggac gcttccattc tttggagagc gaaaaatatt attaataaag 
241 tccccgttat ttttaacatc acagaaagaa aaattggagc aaaatataaa aatccttgag 
301 gaatatattg ctgaaccatc tccattttcg atcatgattt ttatcgctcc atatgaaaag 
361 ttagatgagc gaaaaaaaat tacgaagtta ttaaagaaaa cggctgaggt agtagaagcg 
421 aatgctatgc aagtacaaga tgttaggaaa tggattgtaa agcgtgctga agaagttcat 
481 gtgcatgttg aagaggcggc tgtaagtttg cttttggagc tcgtagggag caatgtaaca 
541 atgttagcaa aggaaatgga taagttaaca ctgtatgttg gaacgggcgg agacattaca 
601 acaaaacttg taagtgaact tgtaccgaag tcagttgagc aaaatgtttt tgccttaaca 
661 gaaaaagtcg taaaaaaaga tatagcagga gcaatgaaaa tattagatgg cttstttaca 
721 caacaggaag aaccaattaa attattagcg ttgctagtga gtcaattccg tttattacat 
781 caagtaaaag aactacagca gcgaggatac ggacaaaatc aaattgcttc gcatattggt 
841 gttcatccgt accgagtaaa attggcgatg aatcaaacga aacttttctc atttgaagaa 
901 ttgaaaaagg taattttcga gttagcggaa gcggactata gtatgaaaac agggaaaatg 
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□ l: AEO 1733 4. Reports Bacillus anthraci..[gi:50082967] 
Comment Features Segu ence 



Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

REMARK 
COMMENT 
FEATURES 

source 



BCT 09-JUL-2004 



AE017334 1011 bp DNA linear 

Bacillus anthracis str. 'Ames Ancestor 1 , complete genome. 
AEQ17334 REGION: 4137069. . 4138079 
AE017334.2 GI: 50082967 

Bacillus anthracis str. 'Ames Ancestor" 

Ba cillus anthracis str. T Ames Ancesto r! # 
Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus 

cereus group. 

1 (bases 1 to 1011) 

Ravel, J. / Rasko, D . A- , Shumway, M . F. , Jiang, L*, Cer,R*Z., 
Federova,N.B. , Wilson, M., Stanley, S., Decker, S., Read,T.D., 
Salzberg,S. and Eraser, CM. 
Bacillus anthracis comparative genomics 
Unpublished 

2 (bases 1 to 1011) 

Ravel, J., Rasko, D , A . , Shumway, M . F- , Jiang, L., Cer,R.Z., 
Federova,N.B. , Salzberg,S. and Eraser, CM. 
Direct Submission 

Submitted ( 17-MAY-2004 ) The Institute for Genomic Research, 9712 
Medical Center Dr, Rockville, MD 20850, USA 

3 (bases 1 to 1011} 

Ravel, J- # Rasko, D. A., Shumway, M. F. , Jiang, L., Cer,R.Z + , 
Federova,N.B. , Salzberg,S + and Eraser, CM. 
Direct Submission i 

Submitted ( 09- JUL-2004 } The Institute for Genomic Research, 971^ 
Medical Center Dr, Rockville, MD 20850, USA 
Sequence update by submitter 

On Jul 9, 2004 this sequence version replaced gi : 4 7 5004 02 - 
Location/Qualifiers 
1. . 1011 

/organism^"Bacillus anthracis str. 'Ames Ancestor 
/mol__type= " ge nomi c DNA" 
/st rain- "Ames Ancestor; A2084" 
/db__xref- "taxon : 261594 " 
/note="MLVA types GT62 and A3.b" 
complement ( 1 . . 1011 } 
/locus_tag= 1t GBAA4548" 
compl ement ( 1 - . 1011) 
/locus tag= "GBAA4 54 8" 
/note="similar to GB:X59739, GB:X59738, GB:X59740, 
SF-P08048, SP:P17010, PID:340434, PID:38020, FID:38022, 
PID:38024, GB:X59739, GB:X59738, GB:X59740, 3P:P08048, 
SP:P17010, PID:340434, PID:38020, PID:38022, and 
PID: 38024; identified by sequence similarity; putative" 
/ccdon__start^l 
/translatable— 11 

htip://www4icbi.nlm.^ 
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/product="conserved hypothetical protein" 
/protein_id=" AAT33669, 1 " 
/db xref="GI: 47504993" 

/translation- "MSDIHKKIKKKQFAPLYLLYGTEAFF1NETIKLITTEALEEEDR 
EFTSrvVTYDLEEAYLEDWEDARTLPFFGERKVLLIKSPLFLTSQKEKL-EQNIKILEEY 

I GE P S PFS I L VFVAP YEKL D E RKKI TKLLKKTAD I VEANAMQVQD VQKW I VARAEEGH 
VHIDNAAVSLLLELVGSNVTMLAKEMDKLTLYVGMGGEITPKLVAELVPKSVEQNVFA 
LTEKVVKKDIAGAiyiQILDGLFTQQEEPlKLLALLVSOFRLLHQVKELQQRGYGQNQIA 
SH IGVHPYRVKLAMHQTKFFSFEELKKVI IELAEADYSMKTGKMDKKLVLEFFLMRLN 

HM" 

ORIGIN ^ tcacatat ga tttaatcgca ttaagaaaaa ctcaagcaca agtttcttat ccatttttcc 

61 aotcttcata ctataatcag cttccgccaa ttctstaatc acttttttta gttcttcaaa 

121 agagaaaaac ttcgtttgat tcatcgctaa ctttacacga tatggatgta caccaatatg 

181 agaagcaatt tgattttgtc cataaccacg ctgctgtaac tcttttactt gatgcagcaa 

241 gcggaattga cttactaaca acgcaagcaa tttaatcggt tcctcctgct gcgtaaataa 

301 tccatccaaa atttgcatcg cacctgcgat atcttttttt accacttttt ctgttaaagc 

361 aaatacattt tgttcaactg attttggcac aagctctgca acaagtttcg gcgtaatctc 

421 tccgcccata ccgacgtata acgttaactt gtccatttcc ttcgccaaca tcgtcacatt 

481 acttcccaca agctctaaca ataaactaac agctgcatta tcaatatgta catgcccttc 

541 ctctgcacga gctacaatcc acttttgaac atcctgcact tgcatcgcat tcgcttctac 

601 tatatctgct gttttcttta atagttttgt aatttttttt cgttcatcaa gcttttcgta 

661 aggcgcaaca aaaacaagaa tagaaaatgg agaaggttcc ccgatatatt cttctaaaat 

721 ttttatattt tgttctaact tttctttttg tgacgttaaa aataatggtg attttattaa 

781 taacacttta cgttctccga aaaaaggaag tgtacgtgca tcctcaacca catcttctaa 

841 atacgcttct tctaaatcat atgtcacaac attaaactcg cgatcttcct cttcaagcgc 

901 ttctgttgta ataagcttta tcgtttcatt tataaaaaac gcttccgttc catacagtaa 

961 atacaacgga gcaaactgct tttttttaat ctttttatgt atatcactca t 
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□ l: NC_003909 , Reports Bacillus cereus A... [gi:42779081] 
Comment Features Seq uence 



Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

PROJECT 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
CONSRTM 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 



TITLE 

JOURNAL 



COMMENT 



FEATURES 

source 



gene 



CDS 



BCT 17-JAN-2Q06 



NC_003909 1011 bp DNA linear 

Bacillus cereus ATCC 10987, complete genome. 
NC_003909 REGION: 4 085571 4 08658 1 
NC_003909, 8 GI : 4 277 9081 
GenomeProj ect : 7 4 

Bacillus cereus ATCC 10987 
Bacillus cer e us ATCC 10987 

Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus 
cereus group. 

1 (bases 1 to 1011) 

Rasko, D-A. , Ravel, J., Okstad,O.A., Helgason,E., Cer,R.Z., Jiang,L-, 
Shores, K, A., Fouts, D.E,, Tourasse, N. J. , Angiuoli , S , V. , Kolonay,J., 
Nelson, W.C, Kolsto, A. -B . , Fraser.C.M. and Read,T.D. 

The genome sequence of Bacillus cereus ATCC 10987 reveals metabolic 
adaptations and a large plasmid related to Bacillus anthracis pXOl 
Nucleic Acids Res. 32 {3), 977-988 (2004) 
14960714 

2 (bases 1 to 1011) 
NCBI Genome Project 
Direct Submission 

Submitted { 10-SEP-2004 ) National Center for Biotechnology 
Information, NIH, Bethesda, MD 20894, USA 

3 (bases 1 to 1011) 

Rasko, D. A. , Ravel, J., Okstad,O.A., Belgason,E., Cer,R.Z., Jiang, L*, 
Shores,K.A., Fouts,D.E., Tourasse, N . J . , Angiuoli, S - V. , Kolonay,J., 
Nelson, W-C, Kolsto, A . -B . , Fraser,C.M. and Read,T.D. 

Direct Submission 

Submitted ( 1 9-FEB-2G04 ) The Institute for Genomic Research, 9712 
Medical Center Dr, Rockvllle, MD 20850, USA 

PROVISIONAL REFSEQ : This record has not yet been subject to final 
NCBI review. The reference sequence was derived from AE0171 94 . 
On Feb 24, 2004 this sequence version replaced qi : 29653128 « 
COMPLETENESS: full length. 

Location/Qualifiers 

1, ,1011 

/organism="Bacillus cereus ATCC 10987 
/mol_type="genomic DNA" 
/strain="ATCC 10987" 
/db__xref="ATCC: 10 987" 
/db xref ^"taxon : 222 523" 



complement (1 . . 1011 ) 
/locus_tag="BCE4 4 04" 
/db_xref-"GeneID: 2750913 
complement ( 1 . . 1011 ) 
/locus__tag="BCE4404 " 
/codon start=l 



tt 
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/translatable- 11 

/product=-"hypothetical protein" 
/proteinjLd=" NF_980697 .1 " 
/db_xref-"GI : 42783450" 
/db_xre f Gene I D : 2750913" 

/translation="MSDIHKKlKKKQFAPLyLLYGTSAFFINETIKLITTEALEEEDR 
EFNWTYDLEEAYLEDWEDARTLPFFGERKVLL1KSPLFLTSQKEKLEQNIKILEEY 

IGEPSPFSI L VFVAP Y EKLDE RKK I T KL LKKT AD WE AN AMQVQ DVQKW I VARADE VH 
VHIDHAAVSLLLELVGSNVTMLAKEMDKLTLYVGMGGDITPKLVAELVPKSVEQNVFA 
LTEKVVKKDIAGAMQILDGLFTQQEEPIKLLALLVSQFRLLHQVKELQQRGYGQNQIA 
SHIGVHPYRVKLAMNQTKFFSFEELKKVIMELAEADYSMKTGKMDKKLVLEFFLMRLN 

HM" "■ 

ORIGIN 

1 tcacatatga tttaaccgca ttaagaaaaa ctcgagcaca agtttcttat ccatttttcc 
61 cgttttcata ctataatcag cttccgctaa ttccattatc acttttttta attcttcaaa 
121 agagaaaaac ttcgtttgat tcatcgctaa ttttacacga tatggatgca caccaatatg 
181 agacgcgatt tgattttgac cgtaaccacg ctgctgtaac tctttcactt gatgcagtaa 
241 acggaattga cttsctaata acgcaagcaa tttaataggt tcctcctgct gcgtaaataa 
301 tccatccaaa atttgcatcg cacccgcgat atcttttttt accacttttt ctgtcaaagc 
361 aaacacattt tgttcaactg attttggcac aagctctgca acgagtttcg gtgtaatatc 
421 tccgcccata ccgacatata acgttaactt gtccatttcc ttcgccaaca tcgttacatt 
481 acttcccaca agctctaaca acaaactaac agccgcatga tcaatatgca catgcacttc 
541 atctgcccga gcaacaatcc acttctgaac atcctgcact tgcattgcat tcgcttctac 
601 tacatccgct gttttcttta atagttttgt aatttttttt cgttcatcca gcttttcgta 
661 aggcgcaaca aaaacaagaa tagaaaaggg agaaggttcc ccaatatatt cttctaaaat 
721 ttttatattt tgttctaact tttctttttg tgaagttaaa aagagtggtg attttattaa 
781 taacacttta cgttctccga aaaaaggaag tgtacgtgca tcctcaacca catcttctaa 
841 atatgcttct tctsaatcat atgtcacaac attaaactcg cgatcttcct cttcaagogc 
901 ttctgttgta ataagcttta tcgtttcatt tataaaaaac gcttctgttc catatagtaa 
961 atataacgga gcaaactgct tctttttaat ctttttatgt atatcactca t 
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□ l: NZ_A AJM Q 10003 93 , Reports Bacillus thuringi...[gi:75763390] 
Features Sequence 



Links 



LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

FEATURES 



NZ_AAJM01000393 1011 bp DNA linear BCT 16-SEP-2005 

Bacillus thuringiensis serovar israelensis ATCC 35646 sql546, whole 
genome shotgun sequence. 

NZ AAJM01000393 REGION: complement { 97 1 107 } 

NZ^AAJMO 10 0 0 3 93 . 1 01:75763390 

WGS. 

Bacillus thuringiensis serovar israelensis ATCC 35646 

Bacillus thuringiensis serovar isr aelensis AT CC 35646 

Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus 

cereus group. 

1 {bases 1 to 1011) 

Anderson,!., Sorokin,A., Kapatral,V., Reznik,G,, Bhattacharya, A* , 
Mikhaileva, N. , Burd,H w Joukov,V., Kaznadzey, D. , Walunas,T + , 
D'Sou2a f M. f Larsen,N., Pusch,G*, Liolios,K., Grechkin,Y., 
Lapidus,A w Goltsman/E.; Chu,L., Fonstein,M., Ehrlich, D, , 
Overbeek,R., Kyrpides,N. and Ivanova^N. 

Comparative genome analysis of Bacillus cereus group genomes with 

Bacillus subtilis 

Unpublished 

2 (bases 1 to 1011) 

Anderson, I . / Sorokin,A., Kapatral,V,, Reznik,G., Bhattacharya, A. , 
Mikhailova, N. , Surd^., Joukov,V., Kaznadzey, D* , Walunas,T., 
D'Souza,M w Larsen,N., Pusch,G., Liolios,K., Grechkin,Y*, 
Lapidus,A., Goltsman,E., Chu,L + , Fonstein.M., Ehrlich, D., 
Overbeek,R., Kyrpides r N. and Ivanova,N. 

Comparative genome analysis of Bacillus cereus group genomes with 

Bacillus subtilis 

Unpublished 

3 (bases I to 1011) 

Anderson, I. , Sorokin,A., KapatraX,V., Reznik,G., Bhattacharya, A. , 
Mikhailova,N. , Burd, H . , Joukov,V., Kaznadzey, D. , Walunas,!,, 
D'Souza^^ Larsen,N w Pusch,G., Liolios,K., Grechkin,Y«, 
Lapidus,A., Golt sman, E. , Chu,L., Fonstein f K., Ehrlich, D., 
Overbeek,R., Kyrpides,N. and Ivanova^. 
Direct Submission 

Submitted (Gl-SEP-2005 ) Integrated Genomics, Inc., 2201 West 
Campbell Park Dr, Chicago, IL 60612, USA 

4 (bases 1 to 1011) 

Anderson,!., Sorokin,A., Kapatrai,V., Reznik,G*, Bhattacharya, A. , 
MIkhailova, N. , Burd,H., Joukov,V., Kaznadzey, D. , Walunas,T., 
D f Souza,M., Larsen,N., Pusch,G., Liolios,K., Grechkin,Y., 
Lapidus,A., Goltsman,E., Chu,L., Fonstein,M 4 , Ehrlich, D,, 
Overbeek,R,, Kyrpides,N, and Ivanova,N- 
Direct Submission 

Submitted { 01-SEP-200S) Integrated Genomics, Inc., 2201 West 
Campbell Park Dr, Chicago, IL 60612, USA 
Location/Qualifiers 
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ORIGIN 



1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 



atgagtgata 
gggacggaag 
gaagaagatc 
gtggttgagg 
tcaccgctat 
gaatacattg 
ct tgatgagc 
aatgcgatgc 
gtgcacattg 
atgttggcga 
acgaaacttg 
gaaaaagtgg 
cagcaggaag 
caagtgaaag 
gtacatccgt 
ttaaaaaaag 
gat aagaaac 



1. .1-011 

/organism="Bacillus thuringlensis serovar israelensis ATCC 
35646" 

/mol_t ype*= "genomic DMA" 
/strain^" ATCC 3564 6" 
/serovar— "israelensis" 
/db_xref-"ATCC: 35646" 
/db_xref="taxon: 339854 " 
1 . .1011 

/locus_tag="RBTH_02545" 
1..1011 

/locus_tag-"RBTHJD2545" * ' 

/EC_number^" 2 .7.7,7 " 
/codon_start~l 
/transl_t able=ll 

/product="DNA polymerase III, delta subunit" 
/protein_id«" ZF_0074 3124 .1 " 
/db_xref-"Gl : 75763391" 

/translation="MSDIHKKIKKKQFAPLYLLYGTEAFFINETIKLITTEALEEEDR 

EFN VVT YDLEEA YLE D WE D ARTL PF FGERKVLLX KS P L FLTSQKEKLEQNI KI LEEY 
IGEPSPFSILVFVAPYEKLDERKKITKLLKKTADWEANAMQVQDVQKWIVARADEVH 
VHIDNAAVSLLLELVGSNVTMLAKEHDKLTLYVGMGGEITTKLVAELVPKSVEQNVFA 
LTEKVVKKDIAGAMQILDGLFTQQEEPIKLLALLVSQFRLLYQVKELQQRGYGQNQIA 
SHI G VH P YRVKLAMNQ T K FFS FEE LKKVI I ELAEAD YSMKTG KMDKKL VLE FFLMRLN 
HM" 

tacataaaaa 
ctttttttat 
gagagtttaa 
atgcgcgtac 
ttttgacttc 
gtgaaccttc 
gaaaaaaaat 
aagtgcagga 
ataacgcggc 
aggaaatgga 
t tgcagagct 
tgaaaaaaga 
aaccgattaa 
agttacagca 
atcgtgtaaa 
tgattataga 
ttgtgctcga 



aatt aaaaag 
aaatgaaacg 
tgttgtgaca 
gcttcctttc 
gcaaaaagaa 
tccattttct 
tacaaaacta 
tgttcagaag 
tgttagtttg 
caagttaacg 
tgtgccaaaa 
tattgcaggt 
attgcttgca 
acgtggttat 
attggcaatg 
attagcggaa 
gtttttctta 



aaacagtttg 

ataaagctta 

tacgatttgg 

tttggagagc 

aagttagagc 

attcttgttt 

ttaaagaaaa 

tggattgttg 

ttgctagagc 

ttatacgttg 

tcagttgagc 

gcgatgcaaa 

ttattagtga 

gggcaaaatc 

aatcaaacoa 

get gat tat a 

atgcgattaa 



etcegttata 
ttacgacaga 
aagaagcgta 
ggaaagtact 
aaaatataaa 
ttgttgcgcc 
cagcagat gt 
ctegggcaga 
tggtgggaag 
gtatgggcgg 
aaaatgtatt 
ttttggatgg 
gtcaattccg 
aaattgeate 
aatttttctc 
gtatgaagac 
atcatatgtg 



tttactatat 
agcacttgaa 
tttagaagat 
gttaataaaa 
aattttagaa 
ttacgaaaaa 
agtagaagcg 
tgaagtgcat 
taatgtaacg 
agagattaca 
tgctttgaca 
at tatttacg 
tttgttgtat 
acatattggt 
ttttgaagaa 
gggaaagatg 
a 



Discla i mer | Write to the Help Desk 
NCBI [ NLM [ NfH 



Aua 15 2006 13:27:38 



http://ww.ncbi.^^ 8/16/2006 



NCBI Sequence Viewer v2.0 



NCBI 




life 



PubMetf Nucleotide 

Search j Nucleotide 

Limits 



Display GenBank(Full) 
Projection on the feature (#1 1) [ 



Protein Genome 

* : for 

Preview/index 

Show 5 - Send to 



Nucleotide 

Structure PMC Taxonomy 



Page 1 of 3 

My NCBI 

[Sign in] [Register] 

OMIM Books 



Go Clear 



History 



Clipboard 



Details 



Show whole Sequence ~~] Features: 

0 I Refresh | 



□ l: NZ_AAQY0I000012. Reports Bacillus weihenst..[gi:89206281] 
Comment Features Seq uence 



Links 



LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



CONSRTM 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
CONSRTM 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



CONSRTM 

TITLE 

JOURNAL 

COMMENT 



N2_AAOY01000019 1011 bp DNA linear BCT C6-MAR-2006 

Bacillus weihenstephanensis KBAB4 ctg24 4, whole genome shotgun 
sequence , 

NZ AAOY01000019 REGION: complement { 8 1 8 1 91 91 ) 

NZ_AAOY0100~0019.1 GI:89206281 

WGS. 

Bacillus weihenstephanensis KBAB4 
Bacillus weihenstephanensis KBAB4 

Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus 
cereus group. 

1 {bases 1 to 1011} 

Lapidus,A., Goltsman,E., Copeland,A., Lucas, S., Barry, K. , 
Detter,J-C, Glavina del Rio, T . , Hammon,N., Israni,S., Dalin,E,, 
TIce;H., Bruce, D. , Pitluck,S w Richardson, P. , Auger, S., 
Galleron,N., Sanchis,V., Gohar,M., Broussole , V . , Brillard,J., 
Guinebretiere, M . -H . , Ehrlich, S . D. , Lereclus, D. , Aymerich, S , , 
Nguen-the,C. and Sorokin,A. 
US DOE Joint Genome Institute (JGI-PGF) 

Sequencing of the draft genome and assembly of Bacillus 

weihenstephanensis KBAB4 

Unpublished 

2 (bases 1 to 1011} 
Larimer, F. and Land,M. 

US DOE Joint Genome Institute (JGI-ORNL) 
Annotation of the draft genome assembly of Bacillus 
weihenstephanensis KBAB4 
Unpublished 

3 (bases 1 to 1011) 

Lapidus,A., Goltsman,E., Copeland,A., Lucas, S-, Barry, K., 
Detter,J.C, Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., 
Tice,H., Bruce, D. , Pitluck,S., Richardson,?., Auger,5., 
Galleron,N., Sanchis,V., Gohar,M., Broussole, V. , Brillard, J ♦ , 
Guinebretiere, M. -H. , Ehrlich, 3 . D. , Lereclus, D. , Aymerich, S . , 
Nuguen~the, C* and Sorokin,A. 
US DOE Joint Genome Institute (JGX-PGF) 
Direct Submission 

Submitted (02-MAR-2006} US DOE Joint Genome Institute, 2800 
Mitchell Drive B100, Walnut Creek, CA 94598-1698, USA 
URL — http: //www. jqi . doe , gov 

Contact : Paul Richardson {microbes^cuba . jgi-psf . org} 
Draft sequencing done at US DOE Joint Genome Institute 
Source DNA and bacteria available from Alexei Sorokin 
(alexei * sorokine^ j ouy . inra . f r ) 

The JGI and collaborators endorse the principles for the 
distribution and use of large scale sequencing data adopted by the 
larger genome sequencing community and urge users of this data to 
follow them. It is cur intention to publish the work of this 
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FEATURES 

source 



gene 
CDS 



project in a timely fashion and we welcome collaborative 
interaction on the project and analysis, 
( ht tp : / /www . qen OTie^ov/pa ge ♦ cf m?pageiD=l 05 0 637 6 ) 
Notes : 

Bacillus weihenstephanensis KBAB4 was originally isolated from 
forest soil near Versailles, France, and was originally identified 
as Bacillus cereus (Vilas-Boas et al, Appl Env Microbiol, 2002, 68, 
1414). More detailed phylogenetic analysis, using MLST, of the 
Versailles Collection of B. cereus and B. thuringiensis strains 
revealed that the strain grows at low temperature (6oC) and 
clusters with many other strains able to grow at low temperatures, 
including the independently isolated and characterized strains 
WSBC10204 and WSBC10206 (Sorokin et al, Appl Env Microbiol, 2006, 
72, 1569} . The latter two strains are the type strains of a newly 
recognized species in the B. cereus group able to grow in cold and 
having the species name Bacillus weihenstephanensis (Lechner et al, 
Int. J* Syst. Bacterioi, 1998, 48, 1373). Since, by MLST, KBAB4 is 
very closely related to WSBC10204 and represents many other 
psychrotrophic strains, it was assigned a species name Bacillus 
weihenstephanensis and the strain KBAB4 should also be considered 
as a type representative of this species. 

Location/Qualifiers 

1. . 1011 

/organism="Bacil2us weihenstephanensis KBAB4" 

/mol_type~" genomic DRA" 

/strain*-"KBAB4" 

/db_xref-" taxon : 315730 " 

1 . . 1011 

/ 1 ocu s_t ag-"Bcer KBAB 4 DRAFT__2 7 8 6" 
1..1011 

/locus_tag-"BcerKBAB4DRAFT_278 6" 

/codon^start^l 
/transl__table=ll 

/product="DNA polymerase III, delta subunit" 
/protein_id^" ZP 01184852. 1 " 
/db_xref^"GX: 89206292" 
/db^xref=" Inter Pro: IFR0G5790 " 
/db xref=»InterPro: IPR010372 " 

/ trans lation= "MS DIHKKIKKKQFAPLyVLYGTEAYFINETIKLITTEALEEEDR 
EFNWTYDLEEAYLEDVVEDAHTLPFFGERKVILIKSPLFLTAQKEKLEQNIKILEEY 
1GEPSPFSILVFVAPYEKLDERKKITKLLKKTADVIEANAMQVQDVQKW1VSRADEVH 
VHIDNAAVSLLLELVGSNVTMLAKEMDKLTLYVGMGGEITPKLVTELVPKSVEQNVFA 
LTEKVVKKDIAGAMQILDGLFTQQEEPIKLLALLVSQFRLLHQVKELQQRGYGQNQ1A 
SHIGVHPYRVKLAMNQTKFFSFEELKKVIIELAEADYSMKTGKMDKKLVLEFFLMRLN 

HM" 



ORIGIN 



I atgagtgata tacataagaa gattaaaaag aaacagttcg ctccgttgta tgtactgtat 
61 ggaacggaag cttattttat aaatgaaacg ataaagctta ttacaacgga agcgcttgaa 
121 gaggaagatc gagagtttaa tgttgtgaca tacgatttgg aagaagcgta tttagaagat 
181 gtagttgagg atgcgcatac gcttcctttt tttggagagc gtaaagttat attaataaaa 
241 tcaccactat ttttaacggc acaaaaagaa aaattagaac aaaatataaa aattttagaa 
301 gaatatattg gggagccatc tcctttttct attcttgttt ttgttgcgcc ttatgaaaaa 
361 ttagacgaac gaaaaaaaat tacaaaacta ttaaagaaaa cagcggatgt aatagaagca 
421 aatgcgatgc aagtgcagga tgttcagaag tggattgttt ctcgtgcgga tgaggtgcat 
481 gtgcatattg ataatgcagc tgttagttta ttgttagagc ttgtgggaag caatgtaaca 
541 atgttggcga aggaaatgga caagttaacg ttatacgtcg gtatgggcgg agaaattacg 
601 ccaaaacttg tcacggaact tgtgccgaaa tctgttgagc aaaacgtgtt tgctttaaca 
661 gaaaaagtgg tgaaaaaaga tatcgctggt gcgatgcaaa ttttagacgg attatttaca 
721 cagcaggaag aaccgattaa actgctcgct ttattagtaa gtcagttccg cttgctgcat 
781 caagtgaaag agttgcaaca gcgtggttac ggacaaaatc aaatcgcgtc ccatattggt 
841 gtacatccgt accgggtaaa gttggcgatg aatcaaacga agtttttctc ttttgaagaa 
901 ttaaaaaaag ttattataga attggcggaa gctgattata gtatgaagac tggaaagatg 
961 gataagaaac ttgtgcttga gtttttctta atgcggttaa atcatatgta g 
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□ l: Z99117 . Reports Bacillus siibtilis...[gi:32468796] 
Comment Features Sequence 



Links 



LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
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ORGANISM 

REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



BSUB0014 104 4 bp DNA linear BCT 18-APR-2005 

Bacillus subtilis complete genome (section 14 of 21) : from 2613658 
to 2812830. 

Z99117 REGION: complement (21662 22705) 
Z99117.2 GI:32468796 



Bacillus subtilis subsp* subtilis str. 
Bacillus subtilis subsp. subtilis str. 



168 
168 



LiU/ H . f 



Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus, 

1 (bases 1 to 1044) 

Kunst,F., Ogasawara f N., Moszer,I., Albert ini, A.M. , Allom,G., 
Azevedo,V., Bertero, M . G . , Bessieres , P . , Bolotin,A. , Borchert, S « , 
Borriss,R., Boursier,L w Brans , A. , Braun,M w Brignell , S . C . , 
Bron,S., Brouillet , S . , Bruschi, C . V . , Caldwell, B . , Capuano,V., 
Carter, N-M., Choi,S.K., Codani,J.J., Connerton, I.E., Cummings , N . J 
Daniel, R. A. , Denizot,F., Devine,K.M., Dusterhof t , A. , Ehrlich, S . D. , 
Emmerson.P.T., Entian,K.D-, Errington, J. / Fabret f C, Ferrari, E., 
Foulger,D., Fritz, C, Fujita,M., Fujita,Y., Fuma,3,, Galizzi,A., 
Galleron,N*, Ghim, S.Y., Glaser,P., Goffeau,A., Golightly, E - J* , 
Grandi,G., Guiseppi,G., Guy,B.J., Haga,K,, Haiech,J., Harwood, C - R . , 
Henaut P A. r Hilbert,H., Holsappel , S . , Hosono,S., Hullo, M.F., 
Itaya,M., Jones, Joris,B., Karamata,D., Kasahara,Y,, 
Klaerr-Blanchard,M. , Klein, C, Kobayashi , Y . , Koetter,P., 
Koningstein, G. , Krogh,S., Kumano,M., Kurita,K*, Lapidus,A., 
Lardinois, S. , Lauber r J., Lazarevic, V. , Lee,S-M., Levine,A., 
Masuda,S., Mauel,C, Medigue,C, Medina, N., Mellado, R. P. , 
Mizuno,M., Moestl,D., Nakai,S., Nobsck,M,, Noone,D., O 1 Reilly, M. , 
Ogawa,K., Ogiwara,A., Oudega,B., Park, 5. H., Parro,V., Pohl,T.M., 
Portetelle, D, , Porwollik, S . , Pre scot £, A.M. , Presecan, E, , Pujic,P., 
Purneile,B,, Rapoport,G., Rey,M., Reynolds, S. f Rieger,IYL, 
Rivolta,C, Rocha,E w Roche, B., Rose,M., Sadaie,Y., Sato,T., 
Scanlan,E., Schleich,S«, Schroeter , R. , Scof fone, F* , Sekiguchi, J. , 
Sekowska,A., Seror,S.J., Serror,P., Shin,B.S., Soldo, B., 
Sorokin,A., Tacconi,£., Takagi,T., Takahashi , K . , Takemaru,K., 
Takeuchi,M., Tamskoshi , A . , Tanaka,T w Terpstra,P., Tognoni,A. , 
Tosato,V., Uchiyama,S., Vandenbol, M. , Vannier,F., Vassarotti , A. , 
Viari,A. , Wambutt,R., Wedler,E w Wedler,H., Weit zenegger , T ♦ , 
Winters, P., Wipat,A., Yamamoto, H. , Yamane,K., Yasumoto,K«, Yata,K w 
Yoshida,K., Yoshikawa, H . F, , Zumstein,E. r Yoshikawa, H, and 
Danchin, A, 

The complete genome sequence of the gram-positive bacterium 
Bacillus subtilis 

Nature 390 {6657), 249-256 (1997) 
9384377 

2 {bases 1 to 1044) 

Kunst,F., Ogasawara, N. , Yoshikawa, H. and Dan-chin, A. 
Direct Submission 
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FEATURES 

source 



NCBI Sequence Viewer v2.0 

JOORHAL Submitted (27- JUN-2003) I, Moszer, A. Danchin, Institut Pasteur, 

Genet ique des Genomes Baeteriens, 28 rue du Docteur Roux, 75724 
Paris Cedex 15, FRANCE. E-mail: moszer@pasteur.fr, 

adanchin@pasteur.fr Phone: +33 (0)1 45 68 84 41, Fax: +33 {0)1 45 
68 89 48 

On Jul 7, 2003 this sequence version replaced gi : 2634966. 
This entry contains data from release R16.1 of the SubtiList 
database. Further data on gene annotation and detailed information 
about changes from previous releases can be found at 
http: //qenolist . pa steur . f r /SubtiList / . 
Location/Qualifiers 
1..1044 

/organism^"Bacillus subtilis subsp. subtilis str. 168" 
/mo i_type=" genomic DNA" 
/strain="168" 
/db_xref="taxon: 224308 " 
1. .1044 
/gene= n yqeN" 
/locus_tag="BSU255 60" 

1..1044 
/gene="yqeN rT 
/locus_tag="BSU25560" 
/ f unc t i on= " unknown " 

/note-"similar to unknown proteins" 
/codon_start=l 
/transl_table=<Li 
/protein__id^" CABl 4 4 98. 1 " 
/db_xref-"GI :2635002" 
/ db_x ref= H GO A : P 5 4 4 5 9 " 
/db_xref*=" Inter Pro: IPR008921 " 
/db_xref-"SubtiList : BG11642" 
/db xref="UniProtKB/Swiss-Prot:P54_459 w 

/ 1 ran slation=" MVFDVWKS LKKG E VHPVYCL YGKETY LLQET VSRI RQTWDQE T 
KDFNLSVFDLEEDPLDQAIADAETFPFMGERRLVIVKNPYFLTGEKKKEKIEHNVSAL 
ESY1QSPAPYTVFVLLAPYEKLDERKKLTKALKKHAFMMEAKELNAKETTDFTVNLAK 
TEQKTIGTEAAEHLVLLVNGHLSSIFQEIQKLCTFIGDREEITLDDVKMLVARSLEQN 
IFELINKIVNRKRTESLQIFYDLLKQNEEPIKIMALISNQFRLILQTKYFAEQGYGQK 
QIASNLKVHPFRVKLAMDQARLFSEEELRLIIEQLAVMDYEMKTGKKDKQLLLELFLL 

QLLKRNEKNDPHY 1 * 
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gene 



CDS 



ORIGIN 



1 atggtatttg atgtgtggaa aagcctgaaa aaaggagagg tccatccggt ttattgttta 
61 tacggaaaag agacatatct gctgcaagaa accgtcagca ggatcagaca gacggtcgtt 
121 gatcaggaga caaacrgattt caatctgtcc gtttttgatc tggaagagga cccgctggat 
181 caagcgattg cagatgctga aacgtttccg tttatggggg agcggcgtct tgtcattgtg 
241 aaaaatccat attttttaac aggtgaaaag aaaaaagaaa aaattgagca caacgtcagt 
301 gcgcttgaat catatataca atcacctgcg ccttacacag tctttgtcct gcttgctccg 
361 tatgaaaagc ttgatgagcg aaaaaagctg acgaaagcgt taaagaagca cgcgtttatg 
421 atggaggcga aggaattaaa tgcaaaggaa acgacagact ttactgtcaa ccttgcaaaa 
481 acagagcaga aaacaatcgg cacggaagcg gcggagcatt tggttctgct tgtgaacggt 
541 catctgtcat cgatttttca ggagattcaa aagctctgca cgtttattgg agatcgtgaa 
601 gaaattacgc tggatgatgt aaaaatgctt gttgctagaa gccttgaaca aaatattttt 
661 gagctgatca ataaaatcgt caaccgaaaa cgaacagaga gtctgcaaat tttttatgat 
721 ttgctaaaac aaaatgaaga accgatcaaa attatggcgc ttatttcgaa tcagttccgg 
781 ctgattctgc agacgaagta cttcgcggaa cagggatacg gacaaaaaca aatcgcttct 
841 aatctaaaag ttcacccatt tcgggtaaag ctggcgatgg atcaagcaag gettttttca 
901 oaggaagagc tccgtttaat tattgaacag cttgccgtca tggactatga gatgaaaacc 
961 gggaaaaagg acaagcagct cctgctcgaa ctgttccttt tacagctgtt aaaaagaaat 
1021 gaaaaaaacg atccccatta ttga 
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□ l: CP000002. Reports Bacillus lichenif ..[gi:56160984] 
Comment Features Se quen ce 



Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REMARK 
COMMENT 
FEATURES 

source 



gene 



CDS 



BCT 03-DEC-2004 



CP000002 1065 bp DNA linear 

Bacillus licheniformis ATCC 14580, complete genome. 
CP 000002 REGION: 2 64 34 34 2 64 4 4 98 
CP000002.2 GI:56160984 

Bacillus licheniformis ATCC 14580 (DSM 13) 
Bacillus licheniformis ATCC 14580 

Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

1 (bases 1 to 1065} 

Rey,M-W., Ramalya, P- , Nelson, B. A. , Brody-Karpin, S - D . , 
Zaretsky, E. J., Tang,NL, de Leon, A. L., Xiang,H., Gusti,V., 
Clausen, I. G,, Olsen, P.B., Rasmus sen, M - D . , Andersen, J. T * , 
Jorgensen, P.L, , Larsen,T.S., Sorokin,A., Bolotin,A., Lapidus,A., 
Galleron.N., Ehrlich,S.D. and Berka,R.M. 

Complete genome sequence of the industrial bacterium Bacillus 
licheniformis and comparisons with closely related Bacillus species 
Genome Biol. 5 (10), R77 (2004) 
15461803 

2 {bases 1 to 1065) 

Berka,R-M,, Rey,M.W. and Ramaiya,P. 
Direct Submission 

Submitted ( 14 - JUL-2004 ) Novozymes Biotech Inc, 1445 Drew Ave, 
Davis, CA 95616, USA 

3 (bases 1 to 1065) 

Berka,R.M., Rey,M.W- and Ramaiya, P. 
Direct Submission 

Submitted (2 9-SEP-2004 ) Novozymes Biotech Inc, 1445 Drew Ave, 
Davis, CA 95616, USA 
Sequence update by submitter 

On Dec 1, 2004 this sequence version replaced gi : 5 200170 2 - 
Location/Qualifiers 
1 ♦ . 1065 

/organism="Bacillus licheniformis ATCC 14580" 

/mol_type="genomic DNA" 

/strain="ATCC 14580; DSM 13" 

/db_xref="ATCC: 14580" 

/rib_xref^"taxon: 279010 " 

/focus 

complement ( 1 * . 1065) 
/gene— "holA" 
/locus_tag-"BL02088" 

complement {1 . . 1065) 

/gene-"holA" 

/locus_tag-"BL02088 H 

/ note ="DNA polymerase III (delta subunit)" 

/codon_start=l 

/transl table=ll 
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NTCBI Sequence Viewer v2.0 

/product="DNA polymerase III clamp loader sufounit, 
C-terminal" 

/prntPi n_i d^" AAU24 258 . 1 " 
/db xref="GI: 52004316" 

/translation="MKEGKVQMTVFDVWKSLKKGDIHPVYCLYGKETHLLQETVQRIR 
QAVVDEETKDFNFSIFDLEEDALDLAVEDAETFPFMGERRLWAKNPAFLTAEKKKDK 
LEHNLGVLEAYITQPAPYSVFVLLAPYEKLDERICKLTKLLKKHAHMVEAKELTAKETA 
DFIAGLAKSEGKQIEPEAAEELVLLCHASLSAIAQEVKKLSTYTGDRGEITIEDVRKL 
VARGLEQNIFEL1NKVVNRKRTEALQIFYDLLKQNEEPIKMMALIANQFRLLMQTKYF 
SDQGYGQKQIASNLKVHPFRVKLAIEQARLFSEQELRNIIEQLAVMDYEMKTGKKDKQ 

LLLELFLLKLLQPHEKNGLL" .... . ■ 

ORIGIN a ccgttttttt catgaggctg cagcagcttc ageaaaaaca gttcaagcag 

61 aagctgcttg tcttttttgc ctgtcttcat ttcataatcc atcacggcga gctgctcgat 
121 aatqttccta agctcctgct ctgaaaaaag cctcgcctgc tcgatcgcca gcttcaocct 
181 gaacgggtgc actttaaggt ttgaggcgat ttgcttttgt ccgtatccct ggtcagaaaa 
241 atatttggtc tgcatcagca gcctgaactg attagcaatc agcgccatca ttttaatcgg 
301 ctcttcattt tgctttaata gatcgtagaa aatctgcagc gcttctgtac gtttccgatt 
361 gacgacctta ttgatcagtt cgaaaatatt ttgttcaagc cctctcgcta caagctttct 
421 gacatcctct atggtgatct ctccgcggtc accggtatat gtactgagct ttttaacctc 
481 ctggaoaatg gctgacagac ttgcatgaca taacaggaca agctcctccg ccgcttccgg 
541 ttcgatttgt ttgccttccg acttcgcgag gcctgcgata aagtcagccg tttctttggc 
601 qgttaattcc ttcgcttcca ccatgtgcgc atgctttttc aggagcttgg tcagcttttt 
661 ccgttcatcg agcttttcat aaggcgcaag caaaacaaac aoggaataag gtgccggctg 
721 cgLatgtaa gcttccagta caccgagatt atgctcaagc ttatccttct ttttttcagc 
781 tgttaaaaac gccgggtttt tcgccacgac aagacgcctt tcccccataa acggaaaggt 
841 ttccgcatcc tccacggcaa gatcgagcgc gtcttcttcc agatcaaata tcgagaagtt 
901 aaagtctttc gtctcttcat caacaacggc ttgtctgatc ctttgaaccg tctcctgcag 
961 aagatgcgtc tctttcccgt acaagcaata aacgggatga atgtcgcctt ttttcaagct 
1021 tttccaaaca tcaaataccg tcatctgcac tttcccctct ttcat 
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Links 
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REFERENCE 
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TITLE 
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REFERENCE 
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TITLE 

JOURNAL 
PUBMED 
REFERENCE 
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TITLE 
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TITLE 
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TITLE 

JOURNAL 
PUBMED 



BCT 01-DEC-20Q4 



BA000004 1029 bp DNA linear 

Bacillus halodurans C-125 DNA, complete genome. 
BA000004 REGION: 1421231 1422259 
BA000004 . 3 GI; 4 7118 318 

« 

Bacillus halodurans C-125 
Bacillus halodurans C-125 

Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

1 

Takami,B. 

Genome analysis of facultatively alkalihilic Bacillus halodurans 
C-125 

(in) Horikoshi,K. and Tsujii,K. (Eds . ) ; 
EXTREMOPHILES IN DEEP-SEA ENVIRONMENTS: 24 9-284; 
Springer- Verlag (1999) 
2 

Takami,H. and Horikoshi,K. 

Reidentif ication of facultatively alkaliphilic Bacillus sp* C-125 
to Bacillus halodurans 

Biosci, Blotechnol. Biochem. 63, 943-945 (1999) 
3 

IakamI,H., Nakasone,K., Hirama,C. f Takaki,Y., MasuI,N., Fujx,F., 
Nakamura,Y. and Inoue,A* 

An improved physical and genetic map of the genome of alkaliphilic 
Bacillus sp. C-125 

Extremophiles 3 (1), 21-28 (1999) 

10086841 

4 

Takami,H., Nakasone,K«, Ogasawara, N. , Hirama,C, Nakamura, Y ♦ , 

Masui^., Fuji,F., Takakl,Y., Inoue,A. and Horikoshi,K. 

Sequencing of three lambda clones from the genome of alkaliphilic 

Bacillus sp. strain C-125 

Extremophiles 3 (1), 29-34 (1999) 

10086842 

5 

Takami,H-, Takaki,Y., Nakasone,K., Hirama,C, Inoue,A. and 
Horikoshi, K. 

Sequence analysis of a 32-kb region including the major ribosomal 

protein gene clusters from alkaliphilic Bacillus sp. strain C-125 

Biosci. Biotechnol. Biochem. 63 {2) r 452-455 (1999) 

10192928 

6 

Takami,H., Masui,N., Nakasone,K. and Horikoshi, K. 
Replication origin region of the chromosome of alkaliphilic 
Bacillus halodurans C-125 

Biosci. Biotechnol. Biochem. 63 (6), 1134-1137 (1999) 
10427704 
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REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
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REFERENCE 
AUTHORS 

TITLE 
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REFERENCE 
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TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



gene 



CDS 



Takami,H., Takaki,Y., Nakasone,K*, Sakiyama, T . t Ma-eno,G. # 
Sasaki, R., Hirama^., Fuji,F, and Masui,N. 

Genetic analysis of the chromosome of alkaliphilic Bacillus 
halodurans C-125 

Extremophiles 3 (3), 227-233 (1999) 

10484179 

8 

Takami,H, and Borikoshi,K. 

Analysis of the genome of an alkaliphilic Bacillus strain from an 
industrial point of view 

Extremophiles 4 (2), 99-108 (2000) -. 

10805564 
9 

Nakasone,K., Masui,N,, Takaki,Y., Sasaki,R., Maeno,<3., Sakiyama, T . , 
Hirama r C*, Fuji,F. and Takami,H. 

Characterization and comparative study of the rrn operons of 

alkaliphilic Bacillus halodurans C-125 

Extremophiles 4 (4), 209-214 (2000) 

10972189 

10 

Takami,H., Nakasone,K., Takaki,Y., Maeno,G. f Sasaki, R., Masui,N., 
Fuji/F w Hirama/C, Nakamura, Y., Oga$awara r N ♦ , Kuhara^S, and 
Rorikoshi, K. 

Complete genome sequence of the alkaliphilic bacterium Bacillus 
halodurans and genomic sequence comparison with Bacillus subtilis 
Nucleic Acids Res, 28 (21), 4317-4331 (2000) 
11058132 

11 (bases 1 to 1029) 
Takami,H„ and Takaki,Y. 
Direct Submission 

Submitted (22-MAR-2000) Hideto Takami, Japan Marine Science and 
Technology Center, Deep-sea Microorganisms Research Group; 2-15 
Natsushima, Yokosuka, Kanagawa 237-0061, Japan 
{E-mail : takamih£ j amstec . go . jp, 

URL : http : / /www . j amstec, go ♦ jp/ jamstec-e/bio/DEEPSTAR/FResearch , html , 
Tel: 81-4 68-67-38 95, Fax : 81-4 68-66-6364 ) 

On or before Dec 8, 2004 this sequence version replaced 
qi: 10172612 , qi: 10172890 , qi: 10173176 , gi: lQ173440 , gl: 10173727, 



qi: 10l74Q30 , qi: 10l74345 , qi:101746l3, qi: 12641879 , qi: 10175192 , 
qi: 101755Q0 , gi : 10175792 , qi:X017 6109 , qi : 1 0 1 7 64 0 1 , 

Location /Qualifiers 

1- . 1029 

/organism="Bacillus halodurans C-125" 

/mol_type=" genomic DNA" 

/strain="C-125" 

/db_xref-"taxon: 272558 " 

/note-"alkaliphile" 

1. .1029 

/gene="BH1337" 
1 . . 1029 

/gene-"BH1337" 
/note="BH1337 

unknown conserved protein" 
/codon_start-l 
/trans 1_ t able -11 
/protein_id^" BAB05056. 1 " 
/db_xref="Gl : 10173953" 

/translation- "MNYLKLKQDVQAGRVAPVYFMYGTELFLMEDL I QD I LS VTLSDE 
ERDMNVSSYSLTDVPIEAALEEAETVPFFGSKRWILKDAALFTSQKLDVEHDVKRLE 
QYILNPVPETVLLIMAPYEKLDERKKITKLlKKESLVLEAKPXiDEGEAKAWLSSLASE 
LQVEMDEKAIETLLGMTGLRLTQLASEMNKLALYVGEGGIIRSEDVTLLVAKTLDQNI 
FDLIDFAINQRTHQALSLYHELLKQKEEPLKLLALLTRQFRIMYQVKELGRRGYTPNQ 
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MAKPLKIHPYVAKLAGKKAASMSDQLLYSLIEKAADTEFA1KSGKV0KVLALELFLLT 
MGSRESVG fr 

ORIGIN 

1 atgaactatt taaaattaaa gcaagatgtc caagcgggac gagtggcccc tgtatatttt 
€1 atgtatggga ccgagttgtt tttaatggaa gacctcatac aggacatttt gtocgtaacg 
121 ttatcggatg aggaaaggga tatgaatgtt tcatcttatt cccttactga tgttccgatt 
181 gaggcggcgc tcgaagaagc agaaacggtt cctttttttg gctcgaagcg ggtcgtgatc 
241 ttaaaagatg ctgccctatt tacaagccaa aagcttgatg ttgagcacga tgtgaaacga 
301 ttggagcaat acatattgaa cccagttcca gagacggttt tgctgatcat ggccccttat 
361 gaaaagctcg atgaacgaaa aaaaatcacg aagcttatca aaaaagagtc gcttgttctt 
421 gaggcaaagc ctttggacga aggagaggca aaagcatggc tatcctca<:t ggcaagcgag 
481 cttcaggtgg agatggacga gaaagctatc gagacgttgt tagggatgac gggtcttogc . • , • 
541 ttgacccaat tagcatctga aatgaataaa ttagcgttgt atgtagggga aggaggcatc 
601 atccgatcag aggacgtaac cttgctcgtt gctaaaacgc tcgatcaaaa catatttgat 
661 ctgattgatt ttgccatcaa tcagaggact catcaagcgc tctctcttta ccatgagctc 
721 ctgaaacaaa aagaggagcc attgaagctg ttggccttat taacacggca gtttcgcatc 
781 atgtaccaag tcaaagagct tggacgaaga ggatacaccc cgaatcagat ggcgaagccg 
841 ttaaaaattc atccctatgt tgccaaactt gcgggaaaaa aagcagccag catgtcagac 
901 caactgttat attccctaat cgaaaaagct gcggatacag agtttgcgat taaaagtgga 
961 aaggttgata aagtgctggc attagaattg tttttactca caatgggttc tagggagagc 
1021 gtaggttag . 

// 
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□ l: AP006627 . Reports Bacillus clausii _[gi:5690801 6] 
Features Seq uence 



Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



FEATURES 

source 



BCT 28-JUL-2006 



AP006627 1023 bp DNA linear 

Bacillus clausii KSM-K16 DNA, complete genome. 
AP006627 REGION: 177 87 9 6 177 98 18 
KP006627.I GI:56908016 

Bacillus clausii KSM-K1 6 
Bacillus clausii KSM-Kl 6 

Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 
1 

Hakamada,Y,, Kobayashi , T - , Hitomi,J., Kawai,S. and Xto,S. 
Molecular clonina and nucleotide sequence of the gene for an 
alkaline protease from the alkalophilic Bacillus sp. KSM-K16 
J. Ferment. Bioeng. 78, 105-108 (1994) 
2 

Kobayashi,T. , Hakamada,Y-, Adachi,S w Hitomi,J., Yoshimatsu, T. , 
Koike, K,, Kawai,S. and Ito,S. 

Purification and properties of an alkaline protease from 
alkalophilic Bacillus sp. KSM-K16 

Appl. Microbiol, Biotechnol* 43 (3), 473-481 (1995) 

7632397 
3 

Shirai,T., Suzuki , A * , Yamane,T., Ashida,!,, Kobayashi , T . , Hitomi,J- 

and Ito, S , . 
High-resolution crystal structure of M-protease: phylogeny aided 

analysis of the high-alkaline adaptation mechanism 

Protein Eng. 10 (6) , 627-634 (1997) 

9278275 

4 

Sakaki,Y., Kageyama,Y., Shimamura, S . , Suzuki, H-, Nishx,S., 
Hatada.Y^ Kawai,S., Ito,S. and Horikoshi, K. 

The complete genome sequence of the alkaliphilic Bacillus clausii 

KSM-K16 

Unpublished 

5 (bases 1 to 1023} 

Sakaki,Y., Kageyama,Y., Shimamura, S . , Suzuki, H., Nishi,S,, 
Hatada,Y. r Kawai,S*, lto,S. and Horikoshi,K. 
Direct Submission 

Submitted ( 1 9-OCT-2003) Yasushi Kageyama, Kao Corporation, 
Biological Science Laboratories; 2606 Akabane, Ichikai-machi, 
Baga-gun, Tochigi 321-3497, Japan 

{E-mail : kageyama . yasushi@ kao . co.jp, URL:http : //www. kao . co . ip/e/ , 
Tel: 81-285-68-7516, Fax : 81-285-68-754 7) 

Location/Qualifiers 

1, . 1023 

/organism-"Bacillus clausii KSM-K16" 
/mol_type«="genomic DNA" 
/strain-"KSM-K16" 
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/ db_x r e f = " t a xon : 66J592 " 
/n.ote^"alkaliphile M 

gene 1 , ♦ 102 3 

/gene^"holA" 
/locus_tag-"ABC1650" 

CDS 1..1023 

/gene="holA" 
/locus_J:ag-"ABC165Q " 
/ EC_numb er^" 2.7,7.7 " 
/codon_start=l 

/transl_table*<Ll ........ 

/product- "DNA polymerase III delta subunit HolA" 
/protein_id=" BAD64 185.1 " 
/db xref="GX: 56909658" 

/ 1 ran ^ 1 a t i on = " MS 1 1 T EMKQLS 5ETMAP V Y FL YGT E R YL I EQFLQKAS E AL FAGE 
EGEMNDIRLRLQDTFLGEWEEAETVPFFSKRKLWIQDFYLATSQKVATKLEHDIPA 
LEAYLDSPAQETVLVWAPYEKLDERKKITKKLKKQTSAIDVNPFDEKETFAWMEAIA 
KQEGVAIEQDAKQLLYKRTGPNLMLLHRELEKCMLYSKDGDPVTASIVDELVAETVEQ 
SVFTVIDYTAKGRAGDAVRLYHQLLRQKEEPLAILALLTRQFRQFFQVKTRLAKGYTQ 
KEMAS QLKL H P Y VVKLAMQQVKALTTKQLRQAL 1 VCEET DYAI KTGQADKE1AVELLL 
LRLAST" 

ORIGIN + + + 

1 atgagcatca ttacggaaat gaagcaattg tcgagtgaga caatggcacc ggtatacttt 

61 ttgtatggga cagaacgtta tttaatcgag caatttttac aaaaagcaag cgaagcatta 

121 tttgcagggg aagaaggcga aatgaatgat attcgcttac gcttacaaga tacgccgcta 

181 ggggaagtgg tggaggaagc ggagactgtt ccgtttttct caaagcggaa acttgttgtg 

241 atccaagact tttatttggc aacgagccag aaagtagcga caaaactgga gcatgacatt 

301 cccgcactgg aagcctatct cgacticgcct gcacaagaaa ccgtgcttgt cgttgtcgcg 

361 ccgtatgaaa aattagatga acgcaagaaa atcaccaaaa aattaaaaaa acaaacgagc 

421 gctattgatg tgaatccatt tgatgaaaag gaaacatttg cgtggatgga ggcgatagct 

481 aaacaggagg gcgttgcgat cgagcaagac gcgaagcaat tgctgtataa gcgtacaggc 

541 cctaacttga tgttgttgca ccgtgaattg gagaagtgca tgttgtatag caaagacggg 

601 gacccagtca cagcaagcat tgtcgacgaa ctcgttgctg agacagtgga acaaagtgta 

661 tttactgtta ttgactacac ggcgaaaggc cgtgctggtg acgcggtccg gctttaccac 

721 cagctgttga ggcaaaaaga agagccgctg gccatactgg cgttgttgac gcggcaattt 

781 cgacaatttt tccaggtgaa aacgaggcta gccaaagggt acacgcaaaa agagatggca 

841 tcacagctta aattgcatcc ttatgtagta aagctcgcca tgcagcaagt caaagcgtta 

901 acgaccaagc aattaagaca agctcttatc gtttgcgaag aaactgacta tgccattaag 

961 acagggcagg ccgataaaga gttggccgtc gagctgttgc tgctgcgatt ggcaagcacg 

1021 taa 

// 

Disclaimer \ Wntejojth^ii^ 
NCBI j NLM 1 N1H 
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######################################## 

# Program: needle 

# Rundate: Wed Aug 23 05:38:23 2006 

# Align__f ormat : srspair 

# Report_f ile: /ebi/extserv/old- work/needle- 2 0 06 0823 -0 538 22 604 01273 .output 
######################################## 

S^^^^^^^^^z:^ „« „„„„„„ 
# 

# Aligned_sequences ; 2 

# 1: SEQ__177 

# 2: G_kaus tophi lus 

# Matrix: EBLOSUM62 

# Gap_penalty: 10.0 

# Extend__penalty : 0.5 
# 

# Length: 1047 

# Identity: 1030/1047 (98.4%) 

# Similarity: 1030/1047 (98-4%) 

# Gaps: 3/1047 { 0-3%) 

# Score : 6325 . 0 
# 

# 











SEQ_177 


1 


ATGCTGGAACGCGTATGGGGAAACATTGA7AAAACGGCGTTTTTCTCCCCT 

M E 1 M M i 1 1 1 1 1 1 M 1 ! 1 1 1 ! 1 i M 1 f 1 1 1 1 1 J I M M 11 1 M f < < 1 1 

Atgctggaacgcgtatggggaaacattgaaaaacggcgtttttctcttct 


50 


G kaus tophi lu 


1 


50 


SEQ_17 7 


51 


TTATTTATTATACGGCAATGAG C C GTTTTTATTAAC GGAAACGTATGAGC 

! ! 1 1 f M J 1 1 1 1 1 1 1 i 1 1 1 1 P ! 1 1 1 f M 1 1 1 1 i 1 1 J I f 1 f 1 i 1 1 ! I ! 1 1 1 

ttatttattatacggcaatgagccgtttttattaacggaaacgtatgagc 


100 


G_kaus toph i lu 


51 


100 


SEQ_177 


101 


GATTGGTG7AACGCAGCGCTTGGCCCGGAGGAGCGGGAGTGGAACTTGGCT 

! i \ I n I i I i l i l i i 1 1 i l i 1 1 i I 11 1 i I i 1 1 1 M 1 1 II 1 1 1 i 1 1 1 1 1 ! 1 

1 1 1 1 f 1 If 1 1 ! 1 1 M j 1 1 1 1 ! 1 1 i 1 M M ! 1 i I 1 1 I M M 1 ! 1 ! 1 l MM 

gattggtgaacgcagcgcttggccccgaggagcgggagtggaacttggct 


150 


G_kaus tophi lu 


101 


150 


SEQ_177 


151 


GTGTACGACTG CGAGGAAACG CCGATC GAGG CGG CGCTTGAGGAGG C CGA 

I f 1 j f J M I ! 1 1 1 i ! 1 1 1 1 1 1 1 1 1 - 1 M M 1 1 1 1 1 M r I J 1 1 1 1 1 1 1 1 1 r 

gtgtacgactgcgaggaaacgccggtcgaggcggcgcttgaggaggccga 


200 


G kaustophilu 


151 


200 


SEQ_177 


201 


GACGGTGCCGTTTTTCGGCGAGCGGCGTGTCATTCTCATCAAGCATCCAT 

f 1 1 i M I i 1 1 1 1 1 1 1 1 II 1 1 M M 1 1 1 M SI M M M 1 1 1 1 1 ! 1 M 1 M 

gacggtgccgtttttcggcgagcggcgtgtcatcctcatcaagcatccat 


250 


G_kaus tophi lu 


201 


250 


SEQ__177 


251 


ATTTTTTTACGT CTGAAAAAGAG AAGGAGAT CGAAC ATGATTTGG CGAAG 

f r r 1 1 1 1 - r i ( 1 i I i 1 1 1 1 1 i e 1 1 i 1 1 1 i I ) i f f i 1 1 1 1 1 1 1 1 1 1 e i ( 1 1 

attttttcacgtctgaaaaagagaaggagatcgaacatgatttggcgaag 


300 


G__kaus tophi lu 


251 


300 


SSQ_177 
G_kaustophilu 


301 
301 


CTGGAGGCGTACTTGAAGGCGCCGTCGCCGTTTTCGATCGTCGTCTTTTT 

1 1 1 1 1 11 11 ! 1 1 II 1 II M I M 1 1 M 1 1 M 1 1 1 1 1 1 II 1 1 M M 1 J i E 1 1 

ctggaggcgtacttgaaggcgccgtcgccgttttcgatcgtcgtcttttt 


350 
350 


SEQ_17 7 
G_kaus tophi lu 


351 
351 


CG CGC CGTACGAG7AAGCTTGAT GAG CG7AAAAAAAATTACGAAG CTC GC CA 

1 1 1 1 1 1 e i m i r i 1 1 f f r 1 1 1 1 i r i e 1 1 1 r 1 1 1 1 e 1 1 1 1 1 1 1 e i m m 1 1 

cgcgccgtacgagaagcttgatgagcgaaaaaaaattacgaagctcgcca 


400 
400 


SEQ_177 

G kaustophilu 


401 
401 


AAGAGCAAAGCGAAGTCGTCATCGCCGCCCCGCTCGCCGAAGCGGAGCTG 

NIIIMiMIMIIIIIIMMIINMIIINIIMMIHIIIMM 

aagagcaaagcgaagtcgtcatcgccgccccgctcgccgaagcggagctg 


450 
450 


S£Q__17 7 


451 


C GTGCC TGGGTG CGGCGC CG CAT CGAGAG C C AAGGGG CGCAAGCAAG CGA 

i i ^ 1 1 1 1 1 1 f j 1 1 E 1 1 1 1 1 1 1 1 1 1 i ] 1 1 e I j 1 1 1 i i i i i t i i 1 1 1 E 1 E 1 1 

cgcgcctgggtgcggcgccgcatcgagagccaaggggcgcaagcaagcga 


500 


G_kaus tophi lu 


451 


500 



SEQ_17 7 


501 


CGAGGCGATTGATGTCCTGTTGCGGCGGGCCGGGACGCAGCTTTCCGCCT 

MIIMIiiMllllllMMIjlMIMIMMMMMIilMMMI 

cgaggcgattgatgtcctgttgcggcgggccgggacgcagctttccgcot 


550 


G__kaus tophi lu 


501 


550 


SEQ_177 


551 


TGGCGAAT GAAATCGATAAATTGG C C CTGTTTGCCGGAT CGGG CGGAAC C 

IIIIMIMIIIIIIIIIIMIMIMHIMIIIMIIIIMMIMM 

tggcgaatgaaatcgataaattggccctgtttgccggatcgggcggaacc 


600 


G__kaustophilu 


551 


600 


SEQ__177 


601 


AT CGAGGC GGCGG CGGTTGAG CGGC TTGT CG C CCGCACGCCGGAAGAAAA 

1 1 1 1 1 1 E 1 i E 1 1 E E 1 i 1 1 1 1 * 1 1 1 1 f I E 1 1 1 1 1 1 1 1 1 i 1 i 1 1 1 1 1 1 1 1 E 1 

atcgaggcggcggcggttgaacggcttgtcgcccgcacgccggaagaaaa 


650 


G_kaustophilu 


601 


650 


SEQ_17 7 
G_kaus tophi lu 


651 
651 


CGTATTTGTGCTT GTC GAGCAAGTGG CGAAGCG CGACATT C CAGCAG CGT 

1 1 1 1 M 1 1 1 1 ! 11 M M 1 f 11 1 M 1 E 1 M M M 1 M I i M M ! H - 1 E 1 ! 

cgtatttgtgcttgtcgagcaagtggcgaagcgcgacattccagcggcgt 


700 
700 


SEQ__177 
G__kaus tophi lu 


701 
701 


TG CAGACGTTTTATGATCTG CTTG AAAAC AATGAAGAG C CGAT CAAAATT 

Mllljl!IIN!jMI!lll!l!llllMllllll!lllll!llll! 

tgcagacgttttatgatctgcttgaaaacaatgaagagccgatcaaaatt 


750 
750 


SEQ_17 7 


751 


TTGGCGTTGCTCGCCGCCCATTTCCGCTTGCTTTCGCAAGTGAAATGGCT 

lllllMMilMIIIIIMlllltlllMlllllllllllllllMIl 

ttggcgctgctcgccgcccatttccgcttgctttcgcaagtgaaatggct 


800 


G__kaustophilu 


751 


800 


SEQ_177 


801 


TG C CTC C TTAGGC TAG GGACAGGC G CAAATTGCTGCGGCG CT CAAGGTG C 

[MIIIMIlllINHI.lillNIIIIMIllllMIMIIIIMIIl 

tgcctccttaggctacgggcaggcgcaaattgctgcggogctcaaggtgc 


850 


G kaus tophi lu 


801 


850 


SSQ_177 


851 


ACCCGTTCCGCGTCAAGCTCGCTCTTGCTCAAGCGGCCCGCTTCGCTGAC 

1 1 1 1 1 E E i 1 1 i i i E 1 ! E I E 1 1 E I J 1 1 ! ! 1 1 E I i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 r 

acccgttccgcgtcaagctcgctcttgctcaagcggcccgcttcgctgac 


900 


G_kaus tophi lu 


851 


900 


SEQ__17 7 


901 


GGAGAGCTTGCTGAGGCGATCAACGAGCTCGCTGACGCCGATTACGAAGT 

1 M f M 1 1 1 1 1 1 1 1 1 i M 1 i 1 1 1 1 1 M r 1 M 1 E M 1 1 1 1 1 M r M 1 M f 1 

ggagagcttgctgaggcgatcaacgagctcgctgacgccgattacgaagt 


950 


G__kaus tophi lu 


901 


950 


SEQ_177 


951 


GAAAAGCGGGGCGGTCGATCGCCGGTTGGCCGTTGAGCTGCTTCTGATGC 

M 1 1 1 1 M ! < ! 1 1 1 1 1 1 1 1 1 1 1 1 1 ai r I r 1 1 1 1 1 1 M M J M I f 1 M M I 

gaaaagcggagcggtcgatcgccggttggccgttgagctgcttctgatgc 


1000 


G_k.au s tophi! u 


951 


1000 


SEQ__177 


1001 


GCTGGGGCGCCCGCCCGGCGCAAGCGGGGCGCCACGGCCGGCGG 1044 

! . 1 1 1 1 N - 1 1 1 M t r ! 1 1 1 1 1 1 1 1 r M f r 1 - 1 1 H M 1 E E 1 E i 

ggtggggcacccgcccggcgcaagcggggcgtcacggccggcggtga 1047 


G__kau s t oph i 1 u 


1001 



######################################## 

# Program: needle 

# Rundate : Wed Aug 23 05:48:17 2006 

# Align_f ormat : srspair 

# Report_f ile: /ebi/extserv/old~work/needle-20060823-05481683725941 .output 
######################################## 

# 

# Aligned_sequences : 2 

# 1: SEQ_177 

# 2: BacilluS_sp_NRRL_B14 911 

# Matrix: EBLOSUM62 

# Gap_jpenalty: 10.0 

# Extend_penalty : 0.5 

# 

# Length: 10 90 

# Identity: 648/1090 (59.4%) 

# Similarity: 648/1090 (59.4%) 

# Gaps: 95/1090 ( 8.7%) 

# Score: 3106.0 
# 

# = : =:s= = = = = = =: ==SS; = — — «««««««„« 



SEQ_17 7 1 ATGCTGGAACGCG - TATGGGGAAAC ~ ATTGAAAAAC - GGCGTTTTTCTCC 4 7 

-III I Mi- HIM i-Mf-MI -,| — lll-l-fl 

Bacillus_sp__N 1 ttggttttagatgtgtggaagcaaataaataacaaacaatttacccc 4 7 

SEQ_1 77 48 CCTTTATTTATTATACGGCAATG - AGCCGTTTTTATTAACGGAAACGTAT 9 6 

MMMilMIMUI.MI IM-Mii-IIMi- lllll-l- 

Bacillus_sp_N 48 gatatatttattatacggaacagaaggctttttgat taat -gaaacaaaa 96 

SEQ_177 97 GAG CGATTGGTGAACG CAGCG CTTGGC C CCGAGGAGCGGGAGT GG AACTT 146 

.IIM-MMIM-I-IIM MMI IMM-MM* 

Bacillus_sp_N 97 cagctgcttatcaacaatgtgctgg ctgaaga-cgagagcgatttta 142 

SEQ__177 14 7 GGCTGTGTACGACTGC GAGGAAACGCCGATCGAGGCGGCGCTTGAG 192 

- . 1 1 1 1 - 1 M — 11- IIIIIIIMMMMIIIMIIill 

Bacillus_sp_N 143 atctgtctacatatgatttggaggaaacaccaatagaagcggcacttgag 192 

SEQ__177 193 GAGGCCGAGACGGTGCCGTTTTTCGGCG AGCGGCGTGTCATTCTCAT 239 

■ I I — I - I I - I J|-|f| ll-lfll 



Bacillus_sp_N 193 gatgcagagaccttccctttcatgggggataagaggctcatctttcttca 242 

SEQ_17 7 24 0 CAAG CATC CATATTTTTTTACGTCTG AAAA AGAGAAGGAGATCGAAC 28 6 

IN f IhMUM-M-IMIMI Ml. -MM. -Il-I 

Bacillus_sp_N 243 caa-c- -ccctcttttctgacttccgaaaagacaaaggggaagactgagc 289 

SEQ_17 7 28 7 ATGATTTGGCGAAGCTGGAGGCGTACTTGAAGGCGCCGTCGCCGTTTTCG 3 36 

l|.|MIIMIflilM--MMMhMINIIIIhMN- 

Bacillus_sp_N 290 ataatctggccaagctggaatcctatttaaaagagccgtcgccatattca 339 

SEQ_1 7 7 337 AT CGT CGT CTTTTTCG CGC CGTACGAGAAGCTTGATGAG CGAAAAAAAAT 386 

1 1 - • M M 1 1 - — I M M f - 1 M 1 1 1 M 1 1 1 1 - • 1 1 1 1 1 1 M I 

Bacillus_sp_N 34 0 attattgtgtttgcggcaccctatgaaaagctggatgatagaaaaaagat 38 9 

S EQ__1 77 3 8 7 TACGAAGCTCGCCAAAGAGCAAAGCGAAGTCGTCATCGCCGCC CCGC 43 3 

I ! I M If ^.MINIMUM MM IM-M--IM .--I 

Bacillus__sp_N 3 90 t acgaag - gaac t gaagaaaaaagc -aagt -gtccttgaggccaaaaaac 43 6 

SEQ_17 7 43 4 TCGCCGAAGC- GGAGCTGCGTGCCTGGGTGCGGCGCCGCATCGAGAGCCA 4 82 

- - M II I M II 1 1 - - • M 1 1 M I - ■ II I - 1 - M — I I 



Bacillus_sp_N 43 7 t -gggggagcaggagctgaaagtctggatcagg gaaagggctgccc 4 81 



SEQ_177 483 AGG--GGCG-CAAGCAAGCGACGAGGCGATTGATGTCCTG TTGCGG- 525 

fl- MM I - 1 1 — 1 1 1 1 • I • - 1 1 ■ M I M M MMM 

Bacillus_sp_jsr 4 82 agaacggcgtccagatt gaggaggatgccgtgcagcttatgctga 526 

SEQ__1 7 7 526 -- CGGGCCGGGAC - - GCAGCTTTCCGCCTTGGCGAATGAAATCGATAAAT 571 

i-IM-M-ll MMIIIMi M M ■ M I M M M I M M I 

Bacillus__sp_N 52 7 ctctggcaggtaccaacctctttatgc - - tgacaggtgaaattgataaat 574 

SEQ177 572 TGGCCCTGTTTGCCGGATCGGGCGGAACC - - ATCGAGGCGGCGGCGGTTG 619 

i 1 ! 1 1 1 1 1 1 i 1 1 - 1 1 - 1 - - - 1 1 - i MMIMMMMMM1 

Bac i 1 lus_sp__N 5 75 tggccttatatgccg- - acgaggaaaagctgattgatgccgaggcagt tg 622 

S SQ_1 77 620 AGCGGCTTGTCGCCCGCACGCCGGAAGAAAACGTATTTGTGCTTGTCGAG 6 69 

I — I- MMMIMMMM Mill M- 

Bacillus_sp_N 6 23 acaagctggttgccaggtcgcttgagcagaatattttcacccttgtcgac 6 72 

SEQ_17 7 6 70 CAAGTGGC GAAGC GC GACATT C CAG CAGC GTTGCAGACGTTTTATGAT CT 719 

Bacillus_sp_N 673 aaggtggtccacagaaaaatagatgaggcactgcggatttattatgatct 722 

SEQ___177 720 GCTTGAAAACAATGAAGAGCCGATCAAAATTTTGGGGTTGCTCGCCGCCC 769 

- - 1 - - 1 - 1 - 1 1 1 1 1 1 1 j i ! i i r i - 1 i 1 1 1 1 - 1 - -i^ i - 1 i - 1 1 i - 1 1 

Bacillus_sp_N 723 tttaaagcaaaatgaagagccgattaaaattctttccattctgaccggcc 772 

SEQ__177 77 0 ATTTCCGCTTGCTTTCGGAAGTGAAATGGCTTGCCTCCTTAGGCTACGGA 819 

[ * I! I - 1 - M ■ f ■ I - 1 1 - M 1 1 ! — Mil! MMIMM 

Bacillus ^spj 773 aattcaggctgatctaccaggtgaaggagctttcgagaaggggatacggc 822 

SEQ_1 77 82 0 CAGGC G CAAATTG CTG CGGC - GC - T CAAGGTGCAC C CGTT CC GCGT GAAG 867 

III- ■! IM1IIIMIMIMMMMIIIM 

Bacillus_sp__N 323 cagcagcagacggc- -cggatacttaaaggtccatcccttcagggtcaag 870 

S EQ__1 7 7 868 CTCGCTCTTGCTCAAGCGGCCCGC - - TTCGCTGACGGAGAGCTTGCTGAG 915 

M - i i — ! I i ^ i I fill MM--ii.i-lii-ii--l- 

Bacillus__sp_N 871 cttgcagccggacaggc- -ccagcttttcggagatgaagaactgtcc 915 

SEQ_177 916 GCG-ATCAACGAGC-TC--GCTGACGCCGATTACGAAGTGAAAAGCGGGG 961 

I I MM — Ml M I I M I M I M M M M M M I 1 1 1 1 Mill 

Baci21us_sp_N 916 -cgcatcatgaagcttctggcagaagcagattatgagatgaaaa-caggg 963 

SEQ_1 77 962 CGGTC GAT CG C CGGTTGGC - CGTTGAGCTGCTTCTGATGCGCTGGGGCGC 1010 

---Ml - - 1 1 1 N - 1 1 - 1 1 - 1 1 ■ I IM I 



Bacillus__sp_N 964 gggatgaacaaggaaatgcttattgagatgtttttgtt cag gc 1006 

SEQ_177 1011 CCGCCCGGCGCAAGCGGGGCGCCAC - GGCCGGCGG 1044 

-I MM li-M MMIM -MMM- 

Bacillus sp_N 1007 tc -cacgac -catgc cctccacaaacagacaaaataa 1041 



# ^###################################### 

# Program: needle 

# Rundate: Wed Aug 23 05:53:46 2006 

# Align_f ormat : srspair 

# Report_f ile : /ebi/extserv/old-work/needle-20060823-05534516331131 , output 
######################################## 



# Aligned_sequences : 2 

# 1: SEQ_177 

# 2: B_cereus_subsp_cytotoxis 

# Matrix: EBLOSUM6 2 

# Gap__penalty : 10.0 

# Extend_penalty : 0.5 

# 

# Length: 1072 

# Identity: 582/1072 (54,3%) 

# Similarity: 582/1072 {54,3%} 

# Gaps: 89/1072 ( 8,3%) 

# Score: 2732.0 
# 

# 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



1 ATGC - TGGAACGCGTATGGGGAAACATTGAAAAACGGCGTTTTTCTCCCC 

III- 1 1 . - 1 - - i ■ I M 1 1 ■ 1 1 1 - 1 1 i I i - 1 1 1 - i 1 1 1 - - 

1 atgagtgatatacataag aaaattaaaaagaaacagtttgctccgt 

5 0 TTTATTTATTATACGGCAATG - AGCCGTTTTTATTAACGGAAACGTATGA 

Ililllllll-ll-IM-.l ! 1 1 1 ■ ■ 1 1 1 1 1 1 ■ 1 1 - 1 1 E I J -I 

4 7 tttatttattgtatggaacggaagcctattttataaat-gaaac aa 

9 9 GCGATTGGTGAAC GC AG CG CTTGG C CCCGAGGAGCGGGAGTGGAAC 

.Mill tl j^||-MM-l-^iMMMU-M- 

92 tcgatttaattacagtagaggcacttgccgaagaagatcgtgattttaat 
145 TTGGCTGTGTACGACTG CGAGGAAAC GC C GATCGAGG CGGCGCTTGAGGA 

- r - 1 - i — 1 1 ^ i I - 1 - - 1 1 - 1 1 - i I - - - f - 1 1 i I - 1 — I - 1 1 1 i i 

142 attgttacatatgatttagaagaggcgtatttagaggatgtagtagagga 
19 5 GGCCGAGACGGTGCCGTTTTTCGGCGAGCGGCGTGTCATTCTCATCAAGC 

.ll-.-ilM-l-li-iMMMMM I..I--MI-IM- 

192 tgcaaggacgcttccattctttggagagcgaaaaatattattaataaagt 
24 5 AT C GATATTTTTTTAC GTCTGAAAAAGAGAAGGAGATC GAACATGATTTG 

..|M..mil-IMI"MIMMI---l 1 1 - i i - - If I • 

242 ccccgttatttttaacatcacagaaagaaaaattg gagcaaaatata 

295 GCGAAGCTGGAGGCGTACTTGAAGGCGCCGTCGCCGTTTTCGATCGTCGT 

.-.i-.ii.iin-.ii-.i-^.|..ii.ii.ii.iiiiiiiiM"i 

28 9 aaaatccttgaggaatatattgctgaaccatctccattttcgatcatgat 
3 4 5 CTTTTT CGCG CCGTACGAGAAGC TTGATGAG CGAAAAAAAATTACGAAGC 

. 1 1 1 ^ M 1 1 . 1 1 . if * E i - 1 1 1 < I 1! 1 1 1 9 1 ! i 1 1 1 E I M M 1 i 3 ! 1 1 

33 9 t tttatcgctccatatgaaaagttagatgagcgaaaaaaaattacgaag - 

3 95 TCGCCAAAGAGCAAAGC™ GAAGTCGTCATCGC CGCCCCGCTCGCCGA 

|....|||||.-l--ll ||.||.||....|| .||...||..|...| 

38 8 ttattaaagaaaacggctgaggtagtagaagcgaatgctatgcaagtaca 

4 41 AGCGGAG CTG CGTG C CTGGGTG CGG CGC CG CAT CGAGAGC C - AAGGGGCG 

ii..i i^i — 1 1 i - j - 1 - 1 - i - i - i I - IM--I-- 

43 8 agatg ttaggaaatggat tgtaaagcgtgctgaagaagtt 



49 
46 
98 
91 
144 
141 
194 
191 
244 
241 
294 
288 
344 
338 
394 
387 
440 
437 
489 
477 



5EG__177 

B_cereus_subs 

SEQ_177 

B_cereus_subs 

SEQ_177 

B_cereus_subs 

SEQ_177 

B cereus subs 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



SEQ 177 



B cereus subs 



490 CA- -AGCAAG-CGACGAGGCGATTG ATGTCCTGTTGCGGCGGGCCGG 533 

M -MM • M - 1 M I M • » 1 1 .|.|.||.|||..||..|..|| 

478 catgtgcatgttgaagaggcggctgtaagtttgcttttggagctcgtagg 527 

534 GACGCAGCTTTCCGCC - TTGGCGAATGAAATCGATAAATTGGCCCTGTTT 582 

II M|..-|..|... I M I M I M M M M 1 1 M M I M M M M M I 

5 28 ga-gcaatgtaacaatgttagcaaaggaaatggataagttaacactgtat 5 76 

5 83 GCCGGATCGGGCGGAACCATCGAGGCGGCGGCGGTTGAGCG GCTTGT 62 9 

MMMMMMMMMIM M-M III---I -Mill 

577 gttggaacgggcggagacatt acaacaaaacttgtaagtgaacttgt 623 

63 0 CGCCCGCACGCCG- - GAAGAAAACGTATTTGTGCTTGTG - GAGCAAGTGG 676 

MIMMMM ll.-MIMIMMIM 1 1 1 - I IMMMMi 

624 --accgaagtcagttgagcaaaatgtttttgc-cttaacagaaaaagtcg 670 

67 7 C GAAG CGC GACATT C CAG CAGC GTTGCAGACGTTTTATGAT CTGCTTGAA 72 6 

* M I I M I M M M M I M M M M M M M M M — M M M M 

6 71 taaaaaaagatatagcaggagcaatgaaaatattagatggcttatttaca 72 0 
727 AACAATGAAGAGCCGATCAAAATTTTGGCGTTGCTCGCCGCCCATTTCCG 776 

. I - - 1 - 1 i I i I - 1 1 - N 1 1 - M I - 1 M 1 1 1 1 1 - 1 MMMM 

721 caacaggaagaaccaattaaattattagcgttgctagtgagtcaattccg 770 

77 7 CTTGCTTTCGCAAGTG AAATGGCT - - TG C CT C CTTAGG CTACGGACAGG C 824 

M i - I M M M M M - M MMM - 1 1 1 - 1 1 1 1 1 1 1 1 - - - 

771 tttattacatcaagtaaaagaactacagcagc- -gaggatacggacaaaa 818 

825 GCAAATTGCTGCGGCGCTCAAGGTG - - CACCCGTT CCGCGTCAAGCTCGC 8 72 

MIMMMMI II -l-.-IMI I M M I M 1 1 M I M I M - M 1 1 

819 tcaaattgcttc-gc-atattggtgttcatccgtaccgagtaaaattggc 866 

873 TCTTGCTCAAGCGGCCCGCTTCGCTGACGGAGAGCTTGCTGAGGCGATC A 922 

. . I - - - I I II M M ■ ■ I * M I M I MMMM 1 1 M M I - 

867 gatgaatcaaacgaaacttttctcatttgaagaattgaaaaaggtaattt 916 

923 ACG AG CTC GC TG AC GC CGATTA C G AAGTGAAAAG CGGGG CGGT CGAT CG C 972 

MMMMMMMMMMM M M I M M M M 1 1 M - 

917 tcgagttagcggaagcggactatagtatgaaaacagggaaaatggataag 966 

973 CGGTTGGCCGTTGAGCTGCTTCTGATGCGCTGGGGCGCCCGCCCGGCGCA 10 2 2 

.-II MM.MMMM-M-MMM M M M • - M I 

967 aagttagttctagaatttttcttaatg agactgaatca 1004 



SEQ 177 



B cereus subs 1005 tatgtga 



1023 AGCGGGGCGCCACGGCCGGCGG 1044 



1011 



# Program: needle 

# Rundate: Wed Aug 23 14 t 53: 14 2006 

# Align_f ormat : srspair 

# Report_f ile : /ebi/extserv/old-work/needle-20060823 -14531389444774 . output 
######################################## 



Aligned_sequences : 2 
1: SEQ_177 
2 : B_anthracis 
Matrix: EBLOSUM62 
Gap__pena 1 ty : 10.0 
Extend_jpenalty : 0.5 

Length: 1066 

Identity; 585/1066 (54.9%} 

Similarity: 585/1066 (54.9%) 
Gaps: 77/1066 ( 7.2%) 

Score; 2717.0 



#-■ 

# 

# 

# 

# 

# 

# 
# 
# 
# 
# 
# 
# 
# 
# 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



1 ATGO TGGAACGCGTATGGGGAAACATTGAAAAACGGCGTTTTTCTCCCC 

in. M-.!*-ui - 1 1 1 1 1 1 - 1 1 1 i i - - 1 1 - - 1 r i I I J I ^ 

1 atgagtgatatacata aaaagattaaaaaaaagcagtttgctccgt 

5 0 TTTATTTATTATAC GG CAAT GAGC C GTTTTTATTAACGGAAAC GTATGAG 

I.MIII|.|-if.il-l'-tl--miiM-IM-Nllf|...-li 

47 tgtatttactgtatggaacggaagcgttttttataaatgaaacgataaag 
100 CGATT GGTGAACG CAG CGC TTGGC C C CGAGGAG CGGGAGTGGAACTTGGC 



i..|.inmn 1 1 - 1 ( 



mi 



ii- f ■ 



97 cttattacaacagaagcgcttgaagaggaagatcgcgagtttaat gt 

15 0 TGTG TACGACTGCGAGGAAACGC CGATC GAGG C GG CG CTTGAGGAGG 

1 1 1 1 i|.|M^ij.M[-M--^MM--i«Mimii.i 

14 4 tgtgacatatgatttagaagaagcgtatttagaagatgtggttgaggatg 
197 CCGAGACGGTGCCGTTTTTCGGCGAGCGGCGTGTCATTCTCATCAAGCAT 

! ^ - * 1 1 - ^ i < 1 1 - 1 1 1 r 1 1 1 1 - 1 1 ii - - - ■ m - 1 - - 1 - 1 1 - 1 r — 

194 cacgtacacttccttttttcggagaacgtaaagtgttattaataaaatca 
24 7 CCATATTTTTTTACGTCTGAAAAAGAGAAGGAGATCGAACATGATTTGGC 

liiiMiiM-IMllMMIMM M-I-IM1MM-M* 

244 ccattatttttaacgtcacaaaaagaaa agttagaacaaaatataaa 



2 97 GAAGCTGGAGGCGTACTTGAAGGCGCCGTCGCCGTTTTCGATCGTCGTCT 

.M-i-U-iMiM-MIMMMI-IMM-lMMM 

2 91 aattttagaagaatatatcggggaaccttctccattttctattcttgttt 



34 7 TTTTCG CGCC GT ACGAGAAG C TTGATGAGC GAAAAAAAATTACGAAG CT C 

I r < E < M I f f < 1 1 1 1 i - i f J E 1 1 1 1 1 E i 1 3 1 1 i E 1 1 1 1 1 1 1 E - 1 f < ! I 

341 ttgttgcgccttacgaaaagcttgatgaacgaaaaaaaattacaaaact- 



3 97 GCCAAAGAGCAAAGC - GAAGTCGTCATCGCCGCCCCGCTCGCCGAAGCGG 

— I j J 1 1 ^ 1 1 1 i ||..|.||....|| IM I i - - - 1 1 - 1 

3 90 attaaagaaaacagcagatatagtagaagcgaatgcgat-gcaagtgcag 



446 AGCTGCGTGCCTGGGTGCGGCGCCGCATCGAG - AGCCAAGGGGC GC 

1-1 INI I M . . _ 1 . . | l Ml-! II-- -Mill -I 

4 39 -gat- -gttcaaaagtggattgtagc- tcgtgcagaggaagggcatgtac 



49 
46 
99 
96 
149 
143 
196 
193 
246 
243 
296 
290 
346 
340 
396 
389 
445 
438 
490 
484 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



B anthx-acis 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



B anthracis 



SEQ 177 



S anthracis 



SEQ 177 



491 AAGCAAGCGACGAGGCGATTGATGTCCTGTTGCGGCGGGCCGGGACGCAG 54 0 

I U..i!.-M-.|-l--IMi.-M-.i M M M M - 

485 atattgataatgcagctgttagtttattgttagagcttg- tgggaagtaa 533 

541 CTTTCCGCC - TTGGCGAATGAAATCGATAAATTGGCCCTGTTTGCCGGAT 58 9 

M ! I M 1 1 * I M 1 ! 1 1 - 1 1 - 1 ! - - ! - - 1 - 1 - ! I M ^ 

534 tgtgacgatgttggcgaaggaaatggacaagttaacgttatacgtcggta 583 

5 90 CGGGCGGAAC C ATC GAGGC GGC GGC GGTTGAG CGGCTTGTCGC CCG CAC G 6 39 

- i 1 1 1 f I E — I i — i - i r 1 1 1 1 1 1 1 1 1 J III — I ^ 

584 tgggcggagagattacgccgaaacttgttgcagagcttgt-gccaaaat- 631 

64 0 CCG- - GAAGAAAACGTATTTGTGCTTGT C GAGCAAGTGG CGAAGCG CGAC 6 87 

i-l li|.ili|.Millli^-|.'^lh-lllili--l!----!h 

63 2 cagttgaacaaaatgtatttgctttaacagaaaaagtggtaaaaaaagat 681 

68 8 ATTCCAGCAGCGTTGCAGACGTTTTATGATCTGCTTGAAAACAATGAAGA 73 7 

M..iii..|iMIII.|-.|l-lll'--i--M----l--MMI 

682 atcgcaggtgcgatgcaaattttggatggattatttacgcagcaggagga 731 

73 8 GCCGATCAAAATTTTGGCGTTGCTCGCCGCCCATTTCCGCTTGCTTTCGC 787 

- 1 1 1 1 1 - 1 1 1 - 1 - 1 * 1 1 1 i 1 1 - i - 1 1 1 - 1 1 i 1 1 e 1 1 e 1 1 — i 

732 accgattaaattgcttgcgttgttagtaagtcaattccgcttgctgcatc 781 

788 AAGTGAAATGGCTTGCCTC - - CTTAGGCTACGGACAGGCGCAAATTGCTG 83 5 

1 1 1 1 - 1 1 f - - 1 M-I--I I - 1 M-M-MMM-mmmmm 

782 aagtaaaagag-ttacagcagcgt-ggttatggacaaaatcaaattgct- 828 

83 6 CGGCGCTCA AGGTG - - CACCCGTTCCGCGTC AAGCTCGCTCTTGCTC 8 8 0 

-INI -MM I M M - - 1 1 ■ M ■ 1 1 1 - M I - - i - - M I 

229 tctcatattggtgtacatccatatcgtgtaaagttagcgatgaatc 874 

8 81 AAG CGG C C CG CTTCGCTGAC GGAGAGCTTG CTGAGG CGAT C AACGAGCTC 93 0 

MMI M I • M • - ■ I - M I * M I • I • 1 1 1 M - M I - ■ I - 

875 aaacgaagtttttctcttttgaagaactaaaaaaagtgattatagaattg 924 

931 GCTGACGCCGATTACGAAGTGAAAAGCGGGGCGGTCGAT - CGCCGGTTG- 978 

1 1 - 1 1 - 1 ! • i M M 1 1 1 1 ■ I ■ - M IMM -I III 

925 gcggaagctgattatagtatgaagactggaaaaatggataagaaacttgt 974 

97 9 GCCGTTGAGCTGCTTCTGATGCGCTGGGGCGCCCGCCCGGCGCAAGCGGG 102 8 

I I MMM-M-M-MMM MMM-I- 

975 gc--ttgagtttttcttaatgcgat taaatcatatgtga 1011 

102 9 GCGCCACGGCCGGCGG 1044 



B anthracis 



1012 



1011 



######################################## 

# Program: needle 

# Rundate: Wed Aug 23 15:01:17 2006 

# Align_format : srspair 

# Report_f lie : /ebi/extserv/old-work/needle-20060823-15011448417323 .output 
######################################## 

# 

# Aligned^sequences : 2 

# 1: SEQ_177 

# 2 : B_cereus 

# Matrix: EBLOSUM62 

# Gapjpenalty : 10.0 

# Extend_penalty : 0.5 
# 

# Length: 1076 

# Identity: 588/1076 (54.6%) 

# Similarity: 588/1076 (54.6%) 

# Gaps: 97/1076 { 9.0%) 

# Score: 26 5 0.0 
# 

# 



SEQJL77 


1 


ATGC-TGGAACGCGTATGGGGAAACATTGAAAAACGGCGTTTTTCTCCCC 

III. M--I--UI MII-MI-MIM-MMMIMIMI-- 

atgagtgatatacata aaaagattaaaaagaagcagtttgctecgt 


49 


E^cereus 


1 


46 


SEQ__177 


50 


TTTATTTATTATACGGCAATGAGCCGTTTTTATTAACGGAAACGTATGAG 

1 - 1 i 1 1 1 1 - 1 1 M ! i J - 1 f - - 1 f 1 1 1 1 1 - 1 ! 1 - 1 1 1 1 f 1 — M 

tatatttactatatggaacagaagcgttttttataaatgaaacgataaag 


99 


B_cereus 


47 


96 


SEQ_17 7 


100 


CGATTGGTGAACG CAG CGCTTGG C C C C GAGGAGCGGGAGTGGAACTTGG C 

|...|-^.|. -1-11111111 M-IMMMMMI- 1- 

cttattacaacagaagcgcttgaagaggaagatcgcgagtttaat gt 


149 


B^cereus 


97 


143 


SEQ__17 7 


150 


TGTG TAG G AC TGCG AGG AAA C G C C GAT CGAGG CGG CG C TTG AGG AGG 

MM MMMMMMMM 1 M M - - 1 - 1 - M M M 1 • 1 

tgtgacatatgatttagaagaagcatatttagaagatgtggttgaggatg 


196 


B_cereus 


144 


193 


SEQ_177 


197 


C CGAGACGGTGC CGTTTTT CGG CGAGC GG CGTGTCATTC T CAT CAAG CAT 

|....|i-MMIMIIIIIII.Ii-M.^MI--[-M-IIMI^^ 

cacgtacacttccttttttcggagaacgtaaagtgttattaataaaatca 


246 


B__cereus 


194 


243 


SEQJ77 


247 


CCATATTTTTTTACGTCTGAAAAAGAGAAGGAGATCGAACATGATTTGGC 

iei — 1 1 1 r i i r < r i - - 1 1 1 1 r 1 1 i m - 1 - 1 11 1 m m i • i * - 

ccactctttttaacttcacaaaaagaaa agttagaacaaaatataaa 


296 


B^cereus 


244 


290 


SEQ_177 


297 


GAAGCTGGAGGCGTACTTGAAGGCGCCGTCGCCGTTTTCGATCGTCGTCT 

. I — 1 M M M M I ■ - 1 ■ • M I • • 1 1* 1 1 - 1 1 - II 1 1 1 M M ■ 1 M 1 M 

aattttagaagaatatattggggaaccttcteccttttctattcttgttt 


346 


B^cereus 


291 


340 


SEQ_177 
B__cereus 


347 
341 


TTTT CGC GC CGTACGAGAAG CTTGATGAG CGAAAAAAAATTACGAAG CTC 

M - 1 M M ! 1 M 1 1 1 1 - 1 1 1 1 1 M II 1 1 M 1 1 1 1! 1 M M II 1 M 1 ■ II 

ttgttgcgccttacgaaaagctggatgaacgaaaaaaaattacaaaact- 


396 
389 


SEQ_17 7 


397 


GCCAAAGAGCAAAGC - GAAGTCGTCATCGC CGCCCCGCTCGCCGAAG 

~ - - 1 1 1 1 1 • < 1 - 1 1 1 1 1 - 1 1 - 1 i M • M - - ■ M - M - M * i 

attaaagaaaacagcggatgtagtagaagcgaatgcaatgcaagtgcagg 


442 


B_cereus 


390 


439 


SEQ_177 


443 


CGGAGCTG - CGTGCCTGGGTG - - CGGCGCCGCATCGAGAGCCAAGGGGCG 


489 


B cereus 


440 


..|.-M ■ M S ■ - 1 • M II Ml M-i M-MMM IM-IM 

atgttcagaagtggattgttgctcgg-gcag-atgaagtgc- -atgtgca 


485 



3EQ_177 

B_cereus 

SEQ_177 

B_cereus 

SEQ_177 

B_cereus 

SEQ_177 

B__cereus 

SEQ_177 

B__cereus 

SEQ_177 

B_cereus 

SEQ_177 

B^cereus 

SEQ__17 7 

B__cereu0 

SEQ_177 

B^cereus 

SEQ___177 

B_cereus 

SEQ__177 

B_cereus 

SEQ_177 

B cereus 



4 90 CA AGCAAGCGAGGAGGCGATTGATGTCCTGTTGCGGCGGGCCGGGAC 

III-. 



486 tattgatcatgc 



ggctgttagtttgttgttagagcttg- tgggaa 



53 7 GCAGCTTTCCGCC - TTGG CGAATGAAAT CGATAAATTGG C CC TGTTTGC C 

|.|..,|,,||-. | | | | | | | I - I I I I I * I I * 1 I - I I - - I - - I - I - I I - I 
53 0 gtaatgtaacgatgttggcgaaggaaatggacaagttaacgttatatgtc 



5 86 GGATCGGGCGGA ACCATCGAGGCGGCGGCGGTTGAGCGGCTTGTC 

IM-MMMM - - 1 1 - 1 1 1 * I 1111- — I i I i 1 1 

5 80 ggt at gggcggagatat tacaccgaaac tc-gt tgcagagc t tgt - 

6 31 GCCCGCACGCCG - - GAAGAAAACGTATTTGTGC- TTGTC - GAGCAAGTGG 

III-..I. I ■ I 1 1 1 - 1 1 1 1 - 1 1 * 1 1 ill MM M--MMM 

624 gccaaaat-cagttgaacaaaatgtgtt-- tgctttgacagaaaaagtgg 
67 7 CGAAGCGCGACATTCCAGCAGCGTTGCAGACGTTTTATGATCTGCTTGAA 

-Ml — M M I - ■ I - 1 • - M I - 1 1 1 i ■ I * - 1 1 • M It - - ■ I - ■ M - - - 

671 t-aaaaaaagatatcgcgggtgcgatgcaaattttggatggattatttacg 
727 AACAATGAAGAGCCGATCAAAATTTTGGCGTTGCTCGCCGCCCATTTCCG 

M • M M I M I M I M I M 1 1 M * M M M M • M M IIMIMI 

721 cagcaggaggaacctattaaattgcttgcgttattagtaagtcaattccg 

7 77 CTTGCTTTCGCAAGTGAAATGGCTTGCCTC - - CTTAGGCTACGGACAGGC 

M I M I M M 1 1 1 1 M M MM-M IM 1 1 M I M I M I . - - 

771 tttactgcatcaagtgaaagag-ttacagcagcgt-ggttacggtcaaaa 
825 GCAAATTGCTGCGGCGC TCAAGGTGC ACCCGTTCCGCGTCAAGCTCG 

MMM I MIMM I 1 1 I f I - 1 I - 1 1 I - 1 1 - 1 i - - 1 - 1 

819 tcaaat- ~c-gcgtctcatattggtgtgcatccatatcgtgtaaaattag 

8 72 C T C TTG CTCAAGCGG C CCGC TTC GC TGACGGAGAG C TTG CTGAGG CGATC 

I - M - • M M i M I | h - 1 1 — I - 1 1 1 - - i M M M M 

866 cgatgaatcaaacgaagtttttctcttttgaagaattaaaaaaagtgata 
922 AACGAGCTCGCTGACGCCGATTACGAAGTGAAAAGCGGG - GCGGTCGAT- 

I . M I * M M I M I M I M I II I MMM MM IMM 

916 atggaattagcggaagctgattatagtatgaaaa-cgggaaaaatggata 
97 0 CGCCGGTTG - GCCGTTGAGCTGCTTCTGATGCGCTGGGGCGCCCGCCCGG 



M — Ml I 



M M M * M ■ 



965 agaaacttgtgc--tcgagtt:tttcttaatgcggt 
1019 CGCAAGCGGGGCGCCACGGCCGGCGG 1044 



taaat 



536 
529 
585 
579 
630 
623 
676 
670 
726 
720 
776 
770 
824 
818 
871 
865 
921 
915 
969 
964 
1018 
1002 



1003 catatgtga 



1011 



######################################## 

# Program: needle 

# Rundate: Wed Aug 23 15:06:55 2006 

# Align_f ormat : srspair 

# Report_f ile : /ebi/extserv/old-work/needle-2GQ60823 -15065418854824 .output 
######################################## 



# 

# Aligned_sequences : 2 

# 1: SEQJL77 

# 2: B_thuringiensis 

# Matrix: EBLOSUM62 

# Gap_penalty: 10.0 

# Extend__penalty ; 0.5 
# 

# Length: 1067 

# Identity: 587/1067 (55.0%) 

# Similarity: 587/1067 (55-0%) 

# Gaps: 79/1067 ( 7.4%) 

# Score: 2694.0 
# 

# 



SEQ_177 1 ATGC-TGGAACGCGTATGGGGAAACATTGAAAAACGGCGTTTTTCTCCCC 4 9 

III- ||..|..|.|| .!IMiMMi.."|..MMIM" 

B thuringiens 1 atgagtgatatacata aaaaaattaaaaagaaacagtttgctccgt 46 



SEQ_177 50 TTTATTTATTAT ACGG CAATGAG C CGTTTTTATTAACGG AAAC GTATGAG 99 

{.ililii.|!IM|.I^M--i-illfl--Mi..iMNI....ll 

B thuringiens 47 tatatttactatatgggacggaagctttttttataaatgaaacgataaag 96 



SEQ_1 77 100 CGATTGGTGAACGCAGCGCTTGGCCCCGAGGAGCGGGAGTGGAACTTGGC 14 9 

I • • • I • • - 1 1 • - M 1 1 • 1 1 1 1 1 1 ■ 1 1 ■ 1 1 ■ 1 1 M • - 1 1 - I ■ 

B_thuringiens 97 cttattacgacagaagcacttgaagaagaagatcgagagtttaat gt 143 



S EQ_1 77 150 TGTG TACGACTG C GAGGAAACG C CGAT CGAGGC GGC G CTTGAGGAGG 196 

MM I M I Ml - M M M M I i - M - 1 - - 1 - 1 - M M M I ■ i 

B thuringiens 144 tgtgacatacgatttggaagaagcgtatttagaagatgtggttgaggatg 193 



SEQ_17 7 197 C™ CGAGACGGTGCCGTTTTTCGGCGAGCGGCGTGTCATTCTCATCAAGCA 24 5 

I II- |M-i-U.M-M-M-MMM---M--i--i-M-M... 

B__thuringiens 194 cgcgt -acgcttcctttctttggagagcggaaagtactgttaataaaatc 242 



SEQ_17 7 24 6 TCCATATTTTTTTACGTCTGAAAAAGAGAAGGAGATCGAACATGATTTGG 29 5 

-II II I i 1 - 1 1 - II - - 1 II 1 1 M • f M * I M i - II • - 1 M - 

B thuringiens 243 accgctatttttgacttcgcaaaaagaaa agttagagcaaaatataa 2 89 



SEQ_177 296 C GAAG CTGGAGGC GTACTTGAAGG CGC C GTC GC C GTTTTC GATCGTCGT C 345 

- - 1 - - - 1 - 1 1 - 1 - M I i ■ i M M I - 1 1 M I M 1 1 M M 1 * • M M 

B_thuringiens 290 aaattttagaagaatacattggtgaaccttctccattttctattcttgtt 339 



S EQ_1 77 3 4 6 TTTTTCGCGCCGTACGAGAAGCTTGATGAGCGAAAAAAAATTACGAAGCT 3 95 

1 1 1 - 1 - 1 M M - M I M - 1 1 - M M M ! I M 1 1 M 1 1 M M II I - 1 1 M I 

B thuringiens 34 0 tttgttgcgccttacgaaaaacttgatgagcgaaaaaaaattacaaaact 38 9 



SEQ_1 77 3 96 CGCCAAAGAGCAAAGCGAAGTCGTCATCGCCGCCCCGCTCGCCGAAGC - G 444 

... Mill. .|. Ml ||..||..|.|..| M---I Ml-M ■ 

B thuringiens 390 -attaaagaaaacagc- -agatgtagtagaag cgaatg-cgatgcaa 432 



SEQ_177 445 GAGCTGCGTGCCTGGGTGCGGC- - GCCGCATCGAG - AGCCAAGGGGCGCA 491 

I * 1 1 * I * • M * * * * I * * I ■ M • I * "I I 111*1 II ■ ■ ■ I ■ I * II * * ■ 

B thuringiens 433 gtgcaggatgttcagaagtggattgttgc- tcgggcagatgaagtgcatg 481 



SEQ_17 7 


492 


B_thuri rig lens 


482 


SEQ_177 


538 


B_thuringiens 


531 


SEQ_177 


587 


B_t hu r i rig i e ns 


581 


SSQ_177 


637 


B_thuringiens 


630 


SEQ_17 7 


683 


B_thuringiens 


677 


SEQ_177 


733 


B_thuringiens 


727 


SEQ_177 


783 


B thuringiens 


777 


SEQ_177 


831 


B thuringiens 


825 


SEQ__17 7 


881 


B t hurl rig i ens 


875 


SEQ_177 


931 


B^tliuringiens 


925 


SEQ_177 


979 


B__thuringiens 


974 


SEQ_177 


1028 


B__thuringiens 


1012 



11-11 -Nil- 



* + | * ■ | | i i 



|..|..| 



Ml! 



Ill 



I- |.| ll.-IIN-lllll ill lll-l ll--llllll.il 



..||..|||...|..||....|..|. 



I UN- 1 1 1 1 1- II 



-il-ll 



MIMIIII--I II- 



I ll-lf-ll-ll----lllil 



Ml--! I 1 1 - 1 1 - 1 1 1 1 •■ 1 1 ■ 1 1 • 1 1 - ■ I - 1 1 •■ I - - • 1 1 

tgcatcacatattggtgtacatccgtatcgtgtaaaattggcaatgaatc 
881 AAGC GGC C CGCTTC G CTGAC GG AG AG CTTGCTGAGGCGATC AACGAGCTC 

1 1 - 1 i I M - M — f - M I - ! 

875 aaacgaaatttttctcttttgaagaattaaaaaaagtgattatagaatta 

:gaagtgaaaagcggg - GCGGTCGAT - CGCCGGTTG 

Mli-i !IM ■ * I • I • I f I -I---- 



ii I - i I s I I - 

gc- - tcgagtttttcttaatgcg^ 
1028 GGCGCCACGGCCGGCGG 1044 

1011 



537 
530 
586 
580 
636 
629 
682 
676 
732 
726 
782 
776 
830 
824 
880 
874 
930 
924 
978 
973 



taaatcatatgtga. 1011 



it tf-rt ft trrt 1 



11 ft fit! ft If : 

# Program: needle 

# Rundate: Wed Aug 23 15:13:23 2006 

# Align_f ormat : srspair 

# Report^f ile : /ebi/ext3erv/old-work/needle-2G060823-15132265139353 - output 
######################################## 



# 

# Aligned_sequences : 2 

# 1: SEQ_177 

# 2: B_weihenstephanensis 

# Matrix: EBLOSUM62 

# Gap_penalty; 10.0 

# Extend__perxalty : 0.5 
# 

# Length: 1074 

# Identity: 579/1074 {53.9%} 

# Similarity: 579/1074 {53.9%} 

# Gaps : 93/1074 { 8.7%} 

# Score : 2650 . 5 
# 

# 

#„„„„„„=„„„„„„„„« 



SEQ_177 


1 


ATGC-TGGAACGCGTATGGGG7A7AACATTGAAAAACGGCGTTTTTCTCCCC 

Ml- ||..|..|.M-I--I 1 1 1 - 1 1 1 1 1 < ■ II * • 1 1 M • ■ 

atgagtgatatacataagaag attaaaaagaaacagttcgctccgt 


49 


B_we ihe ns t ep h 


1 


46 


SEQ__177 


50 


TTTATTTATTATACGG CAATGAG C CGTTTTTATT/AACGGAAAC GTATGAG 


99 


B_jweihensteph 


47 


1 - 1 1 1 - 1 1 • 1 - 1 1 - 1 1 - 1 - - 1 1 - 1 - 1 • 1 M 1 1 1 - - 1 1 1 1 1 1 M 

tgtatgtactgtatggaacggaagcttattttataaatgaaacgataaag 


96 


SEQ_17 7 


100 


CGATTGGTGAACGCAGCGCTTGGCCCCGAGGAGCGGGAGTGGAACTTGGC 

i^-i....|.-Miiiim imuimmmmm i- 


149 


B weihensteph 


97 


cttattacaacggaagcgcttgaagaggaagatcgagagtttaat gt 


143 


SEQJL77 


150 


TGTG TAG GACTG CGAGG7AAACG C CGATCGAGGCGG CGCTTGAGGAGG 

MM 1 1 1 1! • M M 1 - 1 1 1 - 1 1 M M ■ i - 1 - - M 1 1 1 1 1 ■ ! 

tgtgacatacgatttggaagaagcgtatttagaagatgtagttgaggatg 


196 


B__weihensteph 


144 


193 


SEQ_177 


197 


CCGAGACGGTGCCGTTTTTCGGCGAGCGGCGTGTCATTCTCATCAAGCAT 

MMM MMM IMI [MM IMM IM-M M I . . 1 . 1 1 . 1 1 - . . . 

cgcatacgcttcctttttttggagagcgtaaagttatattaataaaatca 


246 


B_weihensteph 


194 


243 


SEQ_17 7 


247 


CCATATTTTTTTACGTCTGAAAAAGAGAAGGAGATCGAACATGATTTGGC 

MM-MIIIIMIMMMIIMIIM 1 - M M 11 1 M M ! M ■ • • 

ccactatttttaacggcacaaaaagaaa aattagaacaaaatataaa 


296 


B__we ihensteph 


244 


290 


SEQ_177 


297 


GAAG CTGGAGG CGTAC TTG7AAGG CG C CGTCG C CGTTTTC GAT CGTCGTCT 

M ■ ■ M M 1 M - M 1 - M - M 1 M M M 1 M 1 M 1 M i M M M M 1 M 

aattttagaagaatatattggggagccatctcctttttctattcttgttt 


346 


B weihensteph 


291 


340 


SEQ_177 


347 


TTTTCGCGCCGTACGAG7AAGCTTGATGAGCG7AAAAAAAATTACG7AAGCTC 

M M M M M M M M M M M M M M M M M M M M M M 1 M M 

ttgttgcgccttatgaaaaattagacgaacgaaaaaaaattacaaaact- 


396 


B__we ihensteph 


341 


3S9 


SEQ__17 7 


397 


GCCAAAGAGC7A7AAGCG7AAGTCGTCATCGCCGCCCCGCTCGCCGAAGC - GG 

, . M 1 M 1 • M M 1 M MMiMMMM M--M IMMI M 

attaaagaaaacagcg- -gatgtaatagaag caaatg-cgatgcaag 


445 


B_we ihensteph 


390 


433 


SEQ_177 


446 


AGCTGCGTGCCTGGGTGCGGCGCCGCATCGAGAGC - C7AAGGGGCGCAAGC 

M 1 M ■ M 1 M M M 1 — M M M M M 1 MMMMIMM 

tgcaggatgttcagaagtgg-attgtttctcgtgcggatgaggtgcatg- 


494 


B weihensteph 


434 


481 



SEQ 177 



B__weihensteph 



SEQ 177 



B weihensteph 



SEQ 177 



B weihensteph 



SEQ 177 



B weihensteph 



SEQ 177 



B_we ihens teph 



SEQ 177 



B weihensteph 



SEQ 177 



B weihensteph 



SEQ 177 



B weihensteph 



SEQ 177 



B weihensteph 



SEQ 177 



B weihensteph 



4 95 AAGC GACGAGG CGATTQATGTCC TGTTGC GG CGGGC C GGGAC 

.11 MM-Mi-MMMM-MMM-MMM -INI- 

482 - tgcatattgataatgcagctgttagtttattgttagagcttg- tgggaa 
53 7 GCAGCTTTCCGCC - TTGGCGAATGAAAT C GATAAATTGGC C CTGTTTG C C 

MI — i - . ! . , . ! M M 1 1 1 - 1 1 1 1! 1 1 M * M 1 - i ^ t - S - i 

53 0 gcaatgtaacaatgttggcgaaggaaatggacaagttaacgttatacgtc 
586 GGATCGGGCGGAACCATCGAGGCGGCGGCGGTTG-AGCGG- - CTTGTCGC 

ii---iiiiiif---ii' mim m ••mi inn - 

580 ggtatgggcggagaaatt acgccaaaacttgtcacggaacttgt- -g 

63 3 GCGCACGCCG - - G AAGAAAAC GTATTTGTG CTTGT CGAG C AAGTGG CGAA 

MMMMM 1 1 - - J I 1 1 r 1 1 - 1 1 1 1 - - - 1 1 1 - - 1 M 1 1 1 - 1 1 1 

62 5 ccgaaatctgttgagcaaaacgtgtttgctttaacagaaaaagtggtgaa 
681 GCGCGACATTC CAGC AGCGTTGCAGACGTTTTATGATCTGCTTGAAAACA 

— 1 1 . 1 1 - - 1 - 1 - - 1 1 1 * 1 1 1 1 - 1 1 1 r - - 1 1 f m • 1 1 — mi 

675 aaaagatatcgctggtgcgatgcaaa ttttaga-cggattatttaca 

7 3 X ATGAAGAGCCGATC AAAATTTTGG CGTTG CT CGC CGCC CATTTC C G 

M 1 1 II M 1 1 1 I M M M M M M M I M M M MMMM 

721 cagcaggaagaaccgattaaactgctcgctttattagtaagtcagttccg 
777 C TTG CTTTCG CAAGTGAAATGG CTTGCCT C - - C TTAGGC TACGG ACAGG C 

M M M M M M M I - I M M - I i-f M M I II 1 1 1 M - 

7 71 cttgctgcatcaagtgaaagag- ttgcaacagcgt-ggttacggacaaaa 
825 GCAAATTGCTGCGGCGCTCAAGGTG- - CACCCGTT CCGCGTCAAGCTCGC 

MMMMMM MM-. MM! I M 1 1 1 1 M I M 1 1 M ! M M M 

819 tcaaatcgcgtc- -ccatattggtgtacatccgtaccgggtaaagttggc 

8 73 TCTTG CTCAAG C GG C C CG CTT CGCTGACGGAGAGCTTGG TGAGG CGAT CA 

. M — M M M M IMMM-MMIMM MMMMi 

86 7 gatgaatcaaacgaagtttttctcttttgaagaattaaaaaaagttatta 
923 ACGAGCTCGCTGACGCCGATTACGAAGTGAAAAGCGGGGCGGTCGAT- CG 

- M I - M M I M M I M I M M M M M - M M * M M M M M 

917 tagaattggcggaagetgattatagtatgaagactggaaagatggataag 



536 
529 
585 
579 
632 
624 
680 
674 
730 
720 
776 
770 
824 
818 
872 
866 
922 
916 
971 
966 



SEQ 177 



B weihensteph 



9 72 CCGGTTG - GC C GTTGAG CTG CTT CTGATG CG CTGGGG CG C CC GC CCGG CG 1020 

Ml 1 1 I i 1 1 M I - M - M M 1 1 1 1 M I- 

967 aaacttgtgc- - ttgagtttttcttaatgcggt taaatca 1004 



SEQ__177 1021 CAAGCGGGGCGCCACGGCCGGCGG 1044 

MM-M 

B_weihensteph 1005 tatgtag 1011 



######################################## 

# Program: needle 

# Rundate: Wed Aug 23 15:18:12 2006 

# Alxgn__f ormat : srspair 

# Report_£ile : /ebi/extserv/old-work/needle-20060823-15181247653341 . output 
######################################## 

===== ====== 

# 

# Aligned_sequences : 2 

# 1: SEQ___177 

# 2: B_subtilis 

# Matrix: EBLOSUM62 

# Gapjpenalty: 10.0 

# Extend__penalty : 0.5 
# 

# Length: 1085 

# Identity: 602/1085 {55,5%) 

# Similarity: 602/1085 (55-5%) 

# Gaps: 82/1085 ( 7.6%) 

# Score: 2814.0 
# 

# 



SEQ_JL77 1 ATGCTGGAACGCGTATGGGGAAACATTGAAAAACGGCGTTTTTC-TCCCC 4 9 

MM 1 1 - 1 1 1 — 1 1 - 1 - 1 1 M t f f - 1 1 - 1 — I - 1 III-- 

B_subtilis 1 atggtatttgatgtgtggaaaagcc- tgaaaaaaggagaggtccatccgg 49 

SEQ_177 5 0 TTTATTTATTATACGGC AATGAGCCGTTTTTATT ~ AACGGAAACGTATGA 98 

f i ■ 



B__subtilis 50 tttattgtttatacggaaaagagacatatctgctgcaagaaaccgtcagc 99 

SEQ__17 7 99 GCGATTGGTGA - ACGCAGCGCTTGGC CCCGAGGAGCGGGAGTGGAACTTG 14 7 

.MIMM-M Mi ■ ■ 1 1 i i i • • I - - 1 1 1 mmmmmmi 

B__subtilis 100 aggatcagacagacg-gtcg- ttgatcaggagacaaaggatttcaatctg 14 7 

SEQ_177 148 GCTGTGTACGA- CTG - - CGAGGAAACGCC - GATCGAGGCGGCGCTTGAGG 193 

Mlill-MM- Ml hi MMMMM 



• i * 



B_subtilis 148 tccgtttttgatctggaagaggacccgctggatcaa gcgattgcag 193 

SEQ__177 194 AGGCCGAGACGGTGCCGTTTTTCGGCGAGCGGCG TGT CATTCT CAT C 24 0 

I M I M I M i M M M 1 1 1 M M M 1 1 1 1 M 1 1 MMM! - 1 - 

B_subtilis 194 atgctgaaacgtttccgtttatgggggagcggcgtcttgtcatt gtg 24 0 

SEQ_177 241 AAGCATCCATATTTTTTTACGTCTGAAAAAGAGAAGG AG AT CG AAC A 28 7 

I I • M M 1 1 II M 1 1 1 1 M M - 11 1 1 M * M M I M I - M - M - 1 1 

B_subtilis 241 aaaaatccatattttttaacaggtgaaaagaaaaaagaaaaaattgagca 290 

SEQ_277 2B8 TGATTTGGC GAAG CTGGAGG CGTACTTGAAGGCGCCGTCGC CGTTTT CGA 3 37 

■ M - - i 1 1 1 M I * M M M M • M - M M I - M I M M - - M - - 

B_subtilis 291 caacgtcagtgcgcttgaatcatatatacaatcacctgcgccttacacag 340 

SEQ__1 7 7 33 8 TCGTCGTCTTTTTCGCGCCGTACGAG7AAGCTTGATGAGCGAAAAA7AAATT 38 7 

M M M M M - M M I M M II M M 1 1 1 1 M M M M M 1 1 II I - M - 

B__subtilis 341 tctttgtcctgcttgctccgtatgaaaagcttgatgagcgaaaaaagctg 390 

SEQ_177 38 8 AC GAAGCTCGC - - CAAAGAGCAAAGCGAAGTCGTCATC 423 

1 1 1 1 1 1 3 I E I - Ml M ■ M M M I M I - - I - 

B_subtilis 391 acgaaagcgttaaagaagcacgcgtttatgatggaggcgaag- -gaatta 438 

SEQ_177 424 GCCGCCCCG- -CTCGCCGAAGCGGAGCTGCGTGCCTGGGTGCGGCGCCGC 471 

. . M I • - M -MM MM Ml Ml Mf M- 

B subtilis 439 aatgcaaaggaaacgac-agactttactgtcaacct tgcaaaaacag 484 



SEQ_177 4 72 ATCGAGAGCCAAGGGGCGCAAGCAAGCGACGAGGCGATTGATGTCCTG - T 520 

I - 1 1 1 1 ! • i I • • ! I • • I i I ll-lll I 

B__subtilis 4 85 agcagaaaacaatcggcac - -ggaagcggcggagcatttg- -gttctgct 53 0 

S EQ__1 77 521 TGCGGCGGGCCGGGACGCAGCTTTC - - CGCCTTGGCGAATGAAATCGATA 56 8 

M- MIM - I J - 1 I - i I MMMMM MM-! 

B_subtilis 531 tgt gaacgg tcatctgtcatcgatttttcaggag-attcaaaa 572 

SEQ_17 7 56 9 AATTGGCCCTGTTTGCCG - - GATCGGGCGGAACCATCGAGGCGGCGGCGG 616 

.«|..|f.r llll — I ! f I M - 1 - J 1 1 - - - M - - J I * i - • I 

B_subti2is 573 gctctgcac- gtttattggagatcgtgaagaaattacgc- - tggatgatg 619 

SEQ_177 617 TTGAGCGGCTTGTCGCCCGCACGCC - GGAAGAAAACGTATTTGTGCTTGT 665 

1^1 — I M I M M ^ 1 MM ■ 1 1 1 - 1 1 1 1 - • I - 1 1 1 1 ■ i M - I 

B_3ubtilis 620 taaaaatgcttgttgctag-aagccttgaacaaaatatttttgagctgat 668 

SEQ_177 666 CGAGCAAGTGGCGAAGCGCGACATTCCAGCAGCGT - TGCAGACGTTTTAT 714 

IMMIMMMMMI IN 11-11 MM-MMMM 

B_subtilis 669 caataaaatcgtcaaccgaaaacgaacag-agagtctgcaaattttttat 717 

SEQ_1 77 715 GATCTGCTTGAAAACAATGAAGAGCCGATCAAAATTTTGGCGTTGCTCGC 764 

ilMNi-.|M.|IIMil|.|[IMf IIIMMIMM^I^i 

B___subtilis 718 gatttgctaaaacaaaatgaagaaccgatcaaaattatggcgcttatttc 7 67 

SEQ_177 765 CGC CCATTT C C G CTTGCTTT CG CAAGTGAAATGGCTTGCCTC CTTAGGCT 814 

..MMNMMMMMMMM'M.U IM 

B_subtilis 768 gaatcagttccggctgattctgcagacgaagtacttcgcggaacagggat 817 

SEQ_177 815 ACGGACAGGCGCAAATTGCTGCGGCGCTCAAGGTGCACCCGTTCCGCGTC 864 

1 1 j 1 1 1 1 — i m j i - f f i - 1 — 1 1 - 1 1 - r i - 1 1 1 1 1 - 1 1 - j i - 1 1 - 

B subtilis 818 acggacaaaaacaaatcgcttctaatctaaaagttcacccatttcgggta 867 

SEQ__177 865 AAGCTCGCTCTTGCTCAAGCGGCCCGCTTCGCTGACGGAGAGCT - TGCTG 913 

I I I i M i 1 - M II II M I 1 I - ■ ! ■ I M I I I II -M- 

B_subtilis 86 8 aagctggcgatggatcaagcaaggcttttttcagaggaggagctccgttt 917 

SEQ_1 77 914 AGG CGATCAAC GAG CT CGCTGACGC CGATTACGAAGTG AAAAG CGGG 96 0 

I MM I ! 1 1 M M • f - - - 1 1 • I f • 1 1 • • 1 1 1 1 1 M M 1 1 

B_subtilis 918 aattattgaac-agcttgccgtcatggactatgagatgaaaaccgggaaa 966 

SEQ_177 961 GCGGTCGATCGCCGGTTGGCCGTTGAGCTGCTTCTGATGC-GCTGGGGCG 10 0 9 

^ t M - 1 - 1 - I ■ - - 1 1 - • I IMIM.M-M MM 

B_subtilis 967 aaggacaagcagctcctgctc gaactgttccttttacagctgttaaa 1013 

SEQ 177 1010 C CC GCC CGG CGCAAGCGGGG CGCCAC GGC C GG CGG 1044 



B subtilis 1014 aagaaatgaaaaaaac-gatccccattattga 1044 



######################################## 

# Program: needle 

# Ruxidate: Wed Aug 23 15:23:37 2006 

# Align_f ormat : srspair 

# Report_f ile : /ebi/extserv/old-work/needle-2 00 60 82 3-15 23 36 73 59157 0 . output 
######################################## 



# 

# Aligned__sequences : 2 

# 1: SEQ__177 

# 2: B_licheniformis 

# Matrix: EBLOSUM62 

# Gap_penalty: 10.0 

# Extend_penalty : 0.5 
# 

# Length: 1123 

# Identity: 616/1123 {54.9%} 

# Similarity: 616/1123 (54.9%) 

# Gaps: 137/1123 (12.2%) 

# Score: 2964.5 
# 

# 



SEQ 177 



B licheniform 



SEQ 17 7 



B licheniform 



SEQ 177 



B licheniform 



SEQ 177 



B licheniform 



SEQ 177 



B licheniform 



SEQ 177 



B licheniform 



SEQ 177 



B licheniform 



SEQ 177 



B licheniform 



SEQ 177 



B licheniform 



SEQ 177 



B licheniform 



1 ATGCTGG- AACG CGTATGGGGAAACAT 

* 1 1 1 • I • -III ■ M - 1 1 1 - I i ■ i 1 

1 atgaaagaggggaaagtgcagatgacggtatttgatgtttggaaaagc-t 
2 7 TGAAAAACGGCGTTTTTC - TC CCC TTTATTTATTATACGG CAATGAG CCG 

1 1 1 1 1 1 1 - 1 H i - - - M I 1 1 i f - 1 f 1 1 N • - M - 1 M 1 1 ^ 1 1 1 1 J - M 

5 0 tgaaaaaaggcgacattcatcccgtttattgcttgtacgggaaagagacg 

7 6 TTTTTATTAACGGAAA C GTATGAG CGATTGGTGAACG CAGCGCTTGG CC C 

-Ill-Ill. I-II-I---III--.- 

100 catcttctgcaggagacggttcaaaggatcagacaagccgttgttgatga 

12 6 CGAGGAGCGGGAGTGGAACTTGGCTGTGTACGA- CTGCGAGGAAACGC - - 

.![[-.h**IM"MIII--l--U--l! MI-MI-MIM 

15 0 agagacgaaagactttaacttctcgatatttgatctggaagaagacgcgc 
173 - CGATCGAGGCGGCGCTTGAGGAGGCCGAGACGGTGCCGTTTTTCGGCGA 

IMIMM-M I - 1 1 1 1 1 - M - 1 1 I r - - 1 - 1 f 1 1 1 1 - i - f I - i I 

20 0 tcgatcttgccg tggaggatgcggaaacctttccgtttatggggga 

222 GCGGCG TGTC ATTCTCATGAAGCATCCATATTTTTTTACGTCTGAAA 

..MM Mlf-I- -M.-ll-l 1 1 - — • 1 M I ! - 1 1 - - 1 f I f 1 1 

246 aaggcgtcttgtcgtg-gcgaaaaac--ccggcgtttttaacagctgaaa 
269 AAGAGAAGG AGATCGAACATGAT - TTGGCGAAGCTGGAGGCGTACTT 

MMIMII M.|.]|.|||Mi l.ihM MMI-M-MM 

2 93 aaaagaaggataagcttgagcataatctcggtgta-ctggaagcttacat 
315 GAAGG CG C CGT CG G CGTTTT CGAT CGT CGTCTTTTTCG CGC CGTACG AGA 

* I . I - i 1 1 1 - 1 - 1 1 ■ ! - M I - I • ■ I ■ H ■ M < • 1 - 1 i 1 1 1 - 1 1 - II - ! 

3 42 tacgcagccggcaccttattccgtgtttgttttgcttgcgccttatgaaa 
3 6 5 AG CTTGATGAG CGAAAAAAAATTACGAAGC T CG C CAAAGAG CA 

IIIMIIM-IMMII--MI-MI1M--. Ill-Mil 

392 agctcgatgaacggaaaaagctgaccaagctcctgaaaaagcatgcgcac 
408 AAGCGAAGTCGTCATCGCCGCC CCGCTCGCCGAAGC 

IMIMII I-..M-IIIM I - 1 1 1 — I - - - 1 - * 

442 atggtggaagcgaag- -gaattaaccgccaaagaaacggctgactttatc 



26 
49 
75 
99 
125 
149 
172 
199 
221 
245 
268 
292 
314 
341 
364 
391 
407 
441 
443 
489 



SEQ__177 
B_lichenif orm 
SEQ_177 
B_lichenif orm 
SEQ_177 
B_lichaniform 
SEQ__177 
B__lichenif orm 
3EQ_177 
B_lichenif orm 
SEQ_177 
B_l i chen i f orm 
SEQ__17 7 
B__l±chenif orm 
SEQ__177 
B_lichenif orm 
SEQ_17 7 
B__l i cheni form 
SEQ__177 
B__l ic ha ni form 
SEQ__177 
B_ 1 i c hen i f orm 
SEQ 177 



444 GGAG C XG CGTG C CTGGGTG CGG CG C C GC ATC GAGAG C C AAGGGG CG CAAG 

IM-MI! Ml M 1 ^ - ! f ■ i M I-J. 

490 gcaggcct cgcgaagtcggaaggcaaacaaat cgaa 

4 94 C AAGCGAC - GAGGCGATTGATGTCCTGTTGCGGCGGGCCGGGACGCA 

i lilM- 1 IIIM.I MUMMM-I-I-MMM 

52 6 ccggaagcggcggaggagct tgtcctgttatgtcatgcaag 

54 0 G C XTTC C G C CTTGGCGAATGAAAT CGATAAATTGGC CCTG TTTGCC 

-IMM-iMMMMMMi UMI---IU-I 

567 tctgtcagccattgcccagga ggttaaaaagctcagtacatatacc 

586 GGATCGGGCGGAACCATCGAGGCGGCGGCGGTTGAGCGGCTTGTCGCCCG 

II. --..Mill... Ill IMM-M IMMI-II-M 

613 ggtgaccgcggagagatcaccatagaggatgtcagaaagcttgtagcgag 
6 36 CAC GC C GGAAGAAAACGTATTTGTG CTTGT CGAG CAAGTGG C GAAGCGCG 

...M-MilMMMMMU.M[.-MM--i-M-t.-llMM* 

663 agggcttgaacaaaatattttcgaactgatcaataaggtcgtcaatcgga 
686 ACATT C CAG CAGC GTTGCAGACGTTTTATGAT CTGCTTGAAAACAATGAA 

I — I ^ 1 S 1 1 1 1 1 - 1 1 1 1 1 1 - > 1 1 < ) i - 1 i i 1 1 ^ 1 - I ^ E - E E E 1 1 ! 

713 aacgtacagaagcgctgcagattttctacgatctattaaagcaaaatgaa 
736 GAGCCGATCAAAATTTTGGCGTTGCTCGCCGCGCATTTCCGCTTGCTTTG 

i i f 1 1 1 J I - i 1 1 1 1 * 1 1 i 1 1 * I i - J - M - - - M i J f - 1 - - 1 1 i I — 

763 gagccgattaaaatgatggcgctgattgctaatcagttcaggctgctgat 
78 6 G CAAGTGAAATGG CTTG CCT CC TTAGG CTACGGACAGGCGCAAATTG CTG 

1 1 1 - - 1 1 1 1 — II! — I • - 1 1 1 1 1 ! M 1 1 — I M M 1 1 1 - 

813 gcagaccaaatatttttctgaccagggatacggacaaaagcaaat-cgcct 

83 6 CGGCGCTCAAGGTGCACCCGTTCCGCGTCAAGCTCGCTCTTGCTCAAGCG 

I II M S ■ M M M I I M I I M M I M I I I I M M M M * M I ■ M ! 

863 caaaccttaaagtgcacccgttcagggtgaagctggcgatcgagcaggcg 

886 GCCCGCTTCGCTGA- CGGAGAGCTTGCTGAGGCG - ATCAACGAGCTCGCT 



913 aggcttttttcagagcag-gagctt 



aggaacattatcgagc-agct 



493 
525 
539 
566 
585 
612 
635 
662 
685 
712 
735 
762 
785 
812 
835 
862 
885 
912 
933 
956 
974 



934 GACGCC GATTACGAAGTGAA AAG C GGGGC GGT CGAT CGC C G 

111 

957 - -cgccgtgatggattatgaaatgaagacaggcaaaaaagacaa- -gcag 1002 



975 GTTGGCCG - - TTGAGCTGCTTCTGATG - CGCTGGGGCGCCCGCCCGGCGC 1021 

-I ||- II li-l M-M-M.il .||||.-ll--ll.|--l-.-- 

B_licheniform 1003 ct cctgcttgaactgtttttgctgaagctgctgcagcctcatgaaaa 104 9 

SEQ_17 7 1022 AAG CGGGG CGC CACGGC CGGCGG 1044 

MM MM-II 

B licheniform 1050 aaac-ggcctcctttaa 1065 



######################################## 

# Progratn: needle 

# Rundate: Wed Aug 23 15:29:49 20 06 

# Align__f ormat : srspair 

# Report_f ile : /ebi/extserv/old-work/needle- 2006G823 -15294971407139 . output 
######################################## 



# 

# Aligned__sequences ; 2 

# 1: SE0__17 7 

# 2 : B_halodurans 

# Matrix: EBLOSUM6 2 

# Gap_penalty: 10.0 

# Extend__penalty : 0.5 
# 

# Length: 1096 

# Identity: 571/1096 (52.1%) 

# Similarity: 571/1096 (52.1%) 

# Gaps: 119/1096 (10-9%) 

# Score: 2648.0 
# 

# 



SEQ_177 1 ATG- - CTGGAACGCGTATGGGGAAACATTGAAAAACGG- - CGTTTTTCTC 46 



B halodurans 1 atgaactatttaaaat-taaagcaagatgtccaagcgggacgagtggccc 49 



SEQ__177 4 7 CCCTTTATTTATTATACGGCAATGAGCCGTTTTTATT-AACGGAAACGTA 95 

I ** I - 1 1 1 1 1 - • I ■ 1 1 - 1 1 - 1 ■ • I i i • - M 



B halodurans 50 ctgtatattttatgtatgggaccgagttgtttttaatggaagacctcata 99 



3 EQ_1 77 9 6 - TGAGC GATTGGTG7AAC G CAG CGCTTGG CC C C GAGGAGC GGG AGTGGAAC 144 

- 1 ■ - I • • I I I I ■ ■ I M ■ I I i I - I I I 1 ■ - ■ I 



B halodurans 100 caggacattt- - tgtccgtaacg- ttatc ggatgaggaaagggat 141 



SEQ_177 14S TTGGCTGTGTACGACTGC GAGGAAACGCCGATCGAGGCGGCGC 187 

- 1 1 - - f 1 1 U-.M-. --ii--iii«iii[iJiiii! 

B halodurans 142 atgaatgt ~ ttcatcttattcccttactgatgttccgattgaggcggcgc 190 



SEQ__177 188 TTGAGGAGGCCGAGACGGTGCCGTTTTTCGGCGAGCGGCGTGTCATTCTC 237 

l-li-M.!l.i|.||ll|.||.!ll(i^Ml^h-ll|.|IM^M 

B halodurans 191 tcgaagaagcagaaacggttcctttttttggctcgaagcgggtcgtgatc 240 



SEQ_177 23 8 ATCAA G CATC CATATTTTTTTAC GTCTGAAAAAGAGAAGGAGAT CG 283 

-l-li E i - < 1 1 a 1 1 1 1 -I ii.m...||..[.[ 

B halodurans 241 ttaaaagatgctgccctattt acaagccaaaagcttgatgttg 283 



SEQ_177 284 AACATGATTTGGCGAAGCTGGAGGCGTAC TTGAA GGCGCCGTCG 327 

UI-I1MI ! 1 1 i i — 111 Mill - i - M — I 

B halodurans 284 agcacgatgtgaaacgattggagcaatacatattgaacccagttccagag 333 



SEQ_177 3 28 CCGTTTTCGATCGTCGTCTTTTTCGCGCCGTACGAG7AAGCTTGATGAGCG 37 7 

■ M-MM-I--M I - J I - 1 1 - II I f • J MM-MIM-li 

B halodurans 334 acggttttgctgatc atggccccttatgaaaagctcgatgaacg 377 



SEQ__17 7 37 8 AAAAAAAATTACGAAGCTCGCCAAAGAGCA^ 426 

IIIIIMIi-miUM---IIM 1 1 1 I - 1 - - f Ml-M- 

B halodurans 3 78 aaaaaaaatcacgaagcttatcaaa aaagagtcgcttgttcttgag 423 



SEQ_177 427 GCC CCGCTCGCCGAAGCGGAGCTGCGTGCCTGGGTG- CGGCGCCGCA 472 

II..M- Mill-. Ill II.IM.|- I-.I- 



B halodurans 424 gcaaagcctttggacgaaggagaggcaaaagcatggctatcctcactggc 4 73 



SEQ 177 



B halodurans 



SEQ 177 



B halodurans 



SEQ 177 



B halodurans 



SEQ 177 



B halodurans 



SEQ 177 



B halodurans 



SEQ 177 



B halodurans 



SEQ 177 



B halodurans 



SEQ 177 



B halodurans 



SEQ 177 



B halodurans 



473 TCGAGAGC- -CAAGGGGCG CAAGCAAGCGA- CGAGGCGATTGATG 

..Mill I ! 3 1 1 - 1 M M I M » I I i I M I - 

4 74 aagcgagcttcaggtggagatggacgagaaagctatcgagacgt 

515 TCCTGTTGCGGCGGGCCGGGACGCAGCTT - T C CO C CTTGG CGAATGAAAT 

MM M t ■ * M M N I M - M M I M I - - M I M I • - • I M M I 

518 tgt t - agggatgacgggtcttcgcttgacccaat tagcatctgaaat 

564 C G ATAAATTGG C C CTGTTTG C CGGATC GGG CGGAACCAT CGAGG CGG CGG 

* M M M M M I - M M M I ■ M i M M 1 M M M I 1 M M I 

564 gaataaattagcgttgtatgtaggggaaggaggcatcatccgatcagagg 
614 CGGTTGAGGGGCTTGTCGCCCGCACGCCGGAAGAAAACGTATTTGTGCTT 

♦ Ml I M - 1 1 • M M M - U • M M M M M M I - M I ■ 

614 acgtaaccttgctcgttgctaaaacgctcgatcaaaacatatttgatctg 
6 64 GT CGAGCAAGTGG CGAAG CG CGAC ATTC CAGC - AGCGTTGCA 

MM l-MMi i-MIMI MM M M M I * 

664 attg attttgc catcaatcagaggactcatcaagcgct-ct 

705 GACGTTT - TATGAT CTGCTTGAAAACAATGAAGAG C CGATCAAAATTTTG 

■ M M i - M M I M I M I • M I M M I M i M M M * M M M M M M 

704 ctctttaccatgagctcctgaaacaaaaagaggagccattgaagctgttg 
754 GCGTTGCTCGC - CGCCCATTTCCGCTTGCTTTCGCAAGTGAAATGGCTTG 

MMi-M-M MM- MM M I- M M M ^ M M I M M ■ M M i I 

754 gccttattaacacggcagttt-cgcatcatgtaccaagtcaaagagcttg 
8 03 CCTCCTTAGGCTACGGAC AGGCGCAAATTGCTGCGGCGCTCAAGGTGCAC 

M i M I M - M M - M I M Ml i ■ - M M I M M M M M I - 

803 gacgaagaggatacaccccgaatcagatggcgaagccgttaaaaattcat 
8 53 CCGTTCCGCGTCAAGCTCGCTCTTGCTCAAGCGGGCCGCTTCGCTG 

MM i M M M I M I M M M M M I M - M M 

8 53 ccctatgttgccaaacttgcgggaaaaaaagcagccagcatgtcagacca 



SEQ 177 



B halodurans 



SEQ 17 7 



B halodurans 



8 99 AC GGAGAGC TTGC TGAGGC GAT CAACGAGC TCG CTGACG C CGATTACGAA 

N M • M - M I M - M * M i M - M M M M M M M I M * M - 

903 actgttatattccctaatcg aaaaagct -gcggatacagagtttgcg 

94 9 GTGAAAAGCGGGGCGGTCGATCGCCGGTTGGCCGTTGAGCTGCTTCTGAT 

MMMMMM.MMMM IMMMMMMMIMIM 

94 9 attaaaagtggaaaggttgataaagtgctggcattagaattgttttt 



SEQ 177 



B halodurans 



999 G CG CTGGGG CG C C CG C CCGG CGCAAGCGGGG CG C CAC GGCCGG CGG 1 0 

. I . M ■ . • - 1 1 ■ ■ M I 1 1 M M I ■ ■ M I ♦ • M 

996 actcacaatgggttctag-ggagagcgtaggttag 10 



# Program: needle 

# Rundate: Wed Aug 23 16:33:21 2006 

# Align_f ormat : srspair 

# Report_f ile: /ebi/extserv/old- work/needle- 20 06 0823- 1533 2 0342 85 052 . output 
######################################## 



# 

# Aligned_sequences ; 2 

# 1: SEQ_177 

# 2: B_clausii 

# Matrix: EBLOSUM6 2 

# Gap_penalty: 10.0 

# Extend_penalty : 0.5 
# 

# Length: 1124 

# Identity: 579/1124 (51.5%) 

# Similarity: 579/1124 (51.5%) 

# Gaps: 181/1124 (16,1%) 

# Score: 2671.5 
# 

# 



SEQ__17 7 1 ATGCTGGAACGCGTATGGGGAAACATTGAAAMCG-GCGTTTTTCTCC 4 7 

..|i...t.--IM...M...|lll...i--l | 

B_clausii 1 atgagcatcattacggaaatgaagcaattgtcgagtgagacaatggcacc 5 0 

SEQ_177 4 8 CCTTTATTTATTATACGGC7AATGAGCCGTTTTTATTAACGGAAACGTATG 9 7 

I - I M I M I M H • I M M - I I -1111-11 



B_clausii 51 ggtatactttttgtatgggacaga- acgttatt- taatcgagcaattttt 98 

SEQ_177 98 AG C GATTGGTGAACG CAGCG CTTGG C C C C GAGGA GCGGGAGTG- - 140 

t i - 1 • I • .1 - 1 1 ■ I i i i MMMI II l-MM 

B__clausii 99 a- caaaaagcaagcgaagcattatttgcaggggaagaaggc-gaaatgaa 146 

SEQ 177 141 - GAACTTGG CTGTGTACGAC TGC GAGGAAACGC CGAT CGAGG CGGCG CTT 189 

IM-IMIII mimmm Mii-iiniMMMiM-MMM 

B_elausii 14 7 tgatattcgct tacgcttac-aagatacgccgctaggggaagtggtg 192 

SEQ 177 190 GAGGAGG C C GAGACGGTGC CGTTTTT CGGCGAG C GG CGTGTCATTCT 23 6 

MM MM.- 1- -I 



* - * 



B_clausii 193 gaggaagcggagactgttccgtttttctcaaagcggaaacttgttgtgat 242 

SEQ 177 237 CATCAAGCATCCATATTTTTTTACGT ~ - CTGAAA - AAGAGAAGGAGATCG 283 

| ||||.. | ^ M JI i III- Mill M - I I - - - I - i - I 

B_clausii 243 c~-caagact- tttatttggcaacgagccagaaagtagcgacaaaactgg 2 89 

SEQ 177 284 AACATGATTTGGCGAAGCTGGAGGCGTA- CTTGAAGGCGCCGTCGC - CGT 331 

1 . 1 1 f I f - - 1 ^ f 1 1 1 1 1 - M ■ i i IMIM -MM- -Ml M- 

B_clausii 2 90 agcatgacattcccgcactggaagcctatctcgac™ tcgcctgcacaaga 33 8 

SEQ 17 7 33 2 TTT CGAT CGTCGT CTTTTTCG CG C CGTACGAGAAG CTTGATGAG CGAAAA 3 81 

. . . i M ■ I M M M I M 1 1! 1 1 i I M M 1 1 M M - M M M I - 1 M I M 

B_clausii 33 9 aaccgtgc- ttgtcgttgtcgcgccgtatgaaaaattagatgaacgcaag 3 87 

SEQ_17 7 382 AAAATTACGAAG CT CGC CAAAGAG CAAAGCGAAGTCGTCATCGC C G C 42 8 

MIIMIMM Ill-I-MM MM IMMMM-i - 

B_clausii 388 aaaatcaccaaaaaattaaaaaaacaaa-cgag- -cgctattgatgtgaa 434 

SEQ_17 7 429 CCCGCTCGCCGAAGCGGAGC TGCGTGCCTGGGTGCGGCGCCGCATC 4 74 

MM-MM-MM-MM- 1 1 1 1 1 1 ■ ■ M I - ■ 1 1 1 — M * M 1 

B clausii 435 tccatt tgatgaaaaggaaacatttgcgtggatggaggcgatagctaaac 484 



SEQ_1 7 7 4 7 5 - - GAG AG C CAAGGGGC G - - C AAGCAAG CGACGAGG CGATTGATGT C CTGT 5 20 

llf-f I Mill I - 1 r 1 1 1 1 - - i 1 1 I E - 1 1 1 1 - 1 1 1 - - 

B_clausii 485 aggagggc gttgcgatcgagcaagacgcgaagcaattgctgtat - -a 529 

SEQ__177 521 TGCG-GCGGGCCGGGAC- -GCAGCT-TTCCGCC TTGGCGAA-TGA 560 

-Ml -1*1111 — II Ml-I II-1MI 1 1 1 f i 1 1 II. 

B_clausii 530 agcgtacaggccctaacttgatgttgttgeaccgtgaattggagaagtgc 579 

SSQ_177 561 A ATCGATAAATTGG CCCTGT- -TTGCCGGATCGGGCG- -GA 597 

I M - f - M — II 1 1 1 - 1 1 MMMM-M t I 

B_clausii 58 0 atgttgtatagcaaagacggggacccagtcacagcaagcattgtcgacga 62 9 

SEQ_17 7 59 8 ACCATCGAGGCGGCGGCGGTTGAGCGGC TTGTCGCCCG 6 35 

il fll-.ii-M-MMt.|.- N-l-i-i" 

B_clausii 630 ac - - tcgttgctgagacagtggaacaaagtgtatttactgttattgacta 677 

SEQ_177 6 36 CACG C CGGAAGAAAACGTATTTGTGCTT GTC GAGCAAGTGGCGAAGCG CG 6 85 

MM-M-IM li M-1MMMMMII 

B__clausii 678 cacggcgaaag gc cgtgctggtgacgcggtccg 710 

SEQ_177 686 ACATT - CCAGCAGC - GTTGCAGACGTTTTATGATCTGCTTGAAAACAATG 733 

MM Ill-lilt MM ll-l MUM 

B_clausii 711 gctttaccaccagctgttg-aggc aaaaag 739 

SEQ l 77 734 AAG AGC CGAT CAAAATTTT GGC GTTGCT CGCCGC - - CCATTTCCGCTTGC 7 81 

. I .... 1 1 .. 1 1 1 1 1 1 1 1 1*1-111 M M M ■ M — 



B_clausii 740 aagagccgctggccatactggcgttg- ttgacgcggcaatttcggcaatt 788 
SEQ__177 782 TTTCGCAAGTGAAA TGGCTTGC C TC C TTAGGCTACGGACAGG C GC AA 828 

MM 1 1 - 1 1 1 1 1 1 -MM. I IMMMIM-M I * 

B_clausix 7 89 tttc-caggtgaaaacgaggctag ccaaagggtacacgcaaaaagag 834 

SEQ__177 829 ATTGCTGCGGCGCTCAAGGTGCACCCGTTCCGCGTCAAGCTCGCTCTTGC 878 

I M I M ■ [ - - - ( II - M - - II I I - M M II M II II M I Mill 

B__clausii 835 atggcatcacagcttaaattgcatccttatgtagtaaagctcgc-catgc 883 
SEQ17 7 87 9 TCAAGC GGC CCGCTTCGCTGACGGAGAGC TTG C TGAGGCGATC 921 

Ml i I! - 1 - 1 1 1 1 - M 1 1 | - 1 f - 1 - 1 * I 

B_clausii 8 84 agcaagtcaaagcgttaac gaccaagcaattaagacaagctctt 927 

3EQ___177 922 AACGAGCTCGCTGACGCCGATTACGAAGTGAAAAGCGGGGC - GGTCGATC 9 70 

|-lf — .fj.-ii — 1*11*11-1 — l-ll-l Mill IMIIM 

B__clausi± 928 atcgtttgcgaagaaactgactatgccattaaga- cagggcaggccgata 976 

SEQ_177 971 GCCGGTTGGCCGTTGAGCTGCTTCTGATGCGCTGGGGCGCCCGCCCGGCG 102 0 

I M M 1 1 M • 1 1 M 1 1 - 1 - 1 1 1 • II M M M I M M M M - 

B_clausii 977 aagagttggccgtcgagctgttgctgctgcgattg gcaagcacgtaa 1023 

SEQ_177 1021 CAAGCGGGGCGCCACGGCCGGCGG 1044 

B clausii 1024 1023 



Exhibit 3 



CLUSTAL W (1.83) Multiple Sequence Alignments 



Sequence format is Pearson 
Sequence 1: SEQ_178 34 8 aa 

Sequence 2: G_kaus tophilus 34 8 aa 

Start of Pairwise alignments 
Aligning . * . 

Sequences (1:2) Aligned. Score: 99 

Guide tree file created: [ /ebi /ext serv/clustalw- 

work/interactive/clustalw-20060623-054 4 5443.dnd] 
Start of Multiple Alignment 
There are 1 groups 
Aligning . . , 

Group 1: Sequences: 2 Score: 7434 

Alignment Score 2093 

CLUSTAL- Alignment file created [ /ebi/extserv/clustalw-work/interactive/clus talw- 
2 00 60823-05 4 4 54 4 3. aln] 



SEQ__17 8 MLERVWGNIEKRRFSPLYLLYGNEPFLLTETYERLVNAALGPEEREWNLAVYDCEETPIE 60 

G^kaustophilus MLERVWGNIEKRRFSLLYLLYGNEPFLLTETYERLVNAALGPEEREWNLAVYDCEETPVE 60 

k * * * * k * k k * k -k k k k 1c k k -k k k * * Ttr -k ik -k k * it k k -k k k k -k -k k rk -k * * * -k k * k k k * * * * * * tJc . * 

SEQ_17 8 AALEEAETVPFFGERRVI LI KH P Y FFT S EKEKE I EH DLAKLE AYLKAPS P FS I WFFAP Y 120 

GJcaustophilus AALEE AE T VP FFGERRVI L I KH P YFFT S EKEKE I EH DLAKLEA YLKAP S P FS I WFFAP Y 12 0 

kkrkkkkkkkkkkk:Tk-kkkkrkk~kk-kkkk-kkk-kk1<-kkkkkkkkkkkkkkrkkk*k*:k~kkk-kk~kk 

SEQ_1 7 8 EKLDERKKITKLAKEQSEVVIAAPLAEAELRAWVRRRI ESQGAQASDEAIDVLLRKAGTQ 18 0 

GJtaustophilus E KLDERKKI T KL AKEQS E W I AAPLAEAELRAWVRRRI E SQGAQAS DEAI DVLLRRAGTQ 180 

^if^^^^^iiyc^^^^^klrkklckkkkl^kkkkklck^k^kkkk^kkkkkkkikkkkkkkkkkkrkk^ 

SEQ_17 8 LSALANE I DKLALFAG S GGT I EAAAVE RLVART PEENV FVL VEQVAKRD I PAALQT F Y DL 2 40 

G_kaustophiluS LS ALAME I DKLALFAG S GGT I EAAAVE RLVART PEENV FVL VEQVAKRD I PAALQT FYDL 2 40 

■kkkkkkkkk. kkk:kkkrk1rkk1fkkkk*k*kk*kkkkkkkkkr + krkk******k*k'kk-kkk-kk1<r 

SEQ_17 8 LENNE E P I KI L ALLAAH FRLLSQ VKWL AS LG YGQAQ I AAALKVH PFRVKLALAQAARFAD 300 

G_kaustophilus LENNEE PI KILALLAAH FRLLSQ VKWLASLGYGQAQIAAALKVHPFRVKLALAQAARFAD 30 0 

SEQ_17 8 GELAEAINELADADYEVKSGAVDRRLAVELLLMRWGARPAQAGRHGRR 34 8 

G_kaustophilus GELAEAINELADADYEVKSGAVDRRLAVELLLMRWGTRPAQAGRHGRR 34 8 

k-kkkrkk-kkkkkkr-kk:kkk-kik-k*'±k*k*kkk-k-kkk-k-kk , *k*kkkkkk-k-k 



CLUSTAL W (1.83) Multiple Sequence Alignments 
Sequence format is Pearson 

Sequence 1: SEQ_178 348 aa 

Sequence 2: Bacillus_spJSJRRL_Bl4 911 346 aa 

Start of Pairwise alignments 
Aligning . . . 

Sequences {1:2} Aligned. Score: 51 

Guide tree file created: [ /ebi/ext serv/clus talw- 

work/interactive/clustalw-20 0 60 8 2 3-05 51 4 55 6. dnd] 
Start of Multiple Alignment 
There are 1 groups 
Aligning , . , 

Group 1: Sequences: 2 Score: 5730 

Alignment Score 1092 

CLUSTAL-Alignment file created [ /ebi /extserv/clustalw-wor k/interactive/ clustalw- 
20060823-05514 55 6. ain] 



SEQ_17 8 

Bacillus_sp_NRRL_Bl 4 911 



MLERVWGNIEKRRFSPLYLLYGNEPFLLTETYERLVNAALGPEEREWNLA 50 
MVLDVWKQINNKQFTPIYLLYGTEGFLINETKQLLINNVLAEDESDFNLS 50 



k k 



-k 



SEQ_17 8 

Bacillus_sp_NRRL_B14 911 



VYDCEETPIEAALEEAETVPFFGERRVILIKHPYFFTSEKEK-EIEHDLA 99 
TYDLEETPIEAALEDAETFPFMGDKRLIFLHNPSFLTSEKTKGKTEHNLA 100 

* * * -*:-k-k-k-kkkkck + -k*k -k-k . -k - A- * * * * - 



SEQ_178 

Bacillus sp NRRL B14 911 



KLEAYLKAPSPFSI VVFFAPYEKLDERKKITKLAKEQSEVVIAAPLAEAE 14 9 
KLESYLKEPSPYSIIVFAAPYEKLDDRKKITKELKKKASVLEAKKLGEQE 150 



k * k k k k 



SEQ_17 8 

Bacillus sp NRRL_B14 911 



LRAWVRRRIESQGAQASDEAIDVLLRRAGTQLSALANEIDKLALFAGSGG 199 
LKVWIRERAAQNGVQIEEDAVQLMLTLAGTNLFMLTGEIDKLALYADEEK 2 00 



* * * . * 



k k k k k k k: . * 



SEQ17 8 

Bacillus sp NRRL B14911 



T I EAAAVERLVART PEENV FVL VEQVAKRD I PAALQT F Y DLL EN N EE P I K 249 
LIDAEAVDRLVARSLEQN1 FTLVDKVVHRKI DEALRI YYDLLKQNEEPiK 25 0 



SEQ_17 8 

Bacillus sp NRRL B14911 



ILALLAAHFRLLSQVKWLASLGYGQAQIAAALKVHPFRVKLALAQAARFA 29 9 
ILSILTGQFRLIYQVKELSRRGYGQQQTAGYLECVHPFRVKLAAGQAQLFG 300 

* ile * * . -kk k k * ±**kk k k +**kkkkk*kk * * * 4 



SEQ_17 8 

Bacillus sp NRRL B14911 



DGELAEAINELADADYEVKSGAVDRRLAVELLLMRWGARPAQAGRHGRR 34 8 
DEELSRIMKLLAEADYEMKTGGMNKEMLIEMFLFRLHDHALHKQTK 34 6 



CLUSTAL W (1,83) Multiple Sequence Alignments 



348 aa 
336 aa 



4 6 



[ /ebi/ext serv/clust alw- 



Sequence format is Pearson 
Sequence 1: SEQ_178 
Sequence 2: B_cereus_cytotoxis 
Start of Pairwise alignments 
Aligning . , . 

Sequences (1:2) Aligned. Score: 
Guide tree file created: 

work/ interact ive/clust alw-2 00 60 8 2 3-0 60307 1.5- dnd] 
Start of Multiple Alignment 
There are 1 groups 
Aligning , . . 

Group 1: Sequences; 2 Score: 54 37 

Alignment Score 961 

CLUSTAL-Alignment file created [ /ebi/extserv/clustalw-work/interactive/clustalw- 
20060823-06030715 . aln] 



SEQ__178 

B cereus cytotoxis 



MLERVWGNIEKRRFSFLYLLYGNEPFLLTETYERLVNAALGPEEREWNLAVYDCEETPIE 60 
-MSDIHKKIKKKQFAPFYLLYGTEAYFINETIDL1TVEALAEEDRDFNIVTYDLEEAYLE 5 9 



■ * Tfc- * + * * * * * 



■k * 



* * 



* * * 



* * * * 



SEQ_17 8 

B cereus cytotoxis 



AALEEAETVPFFGERRV1LIKHPYFFTSEKEKEIEHDLAKLEAYLKAPSPFSIVVFFAPY 12 0 
DVVEDARTLPFFGERKILLIKSPLFLTSQKEK-LEQNIKILEEYIAEPSPFSIMIFIAPY 118 

* * «■ * ************** -k ******** « * * . . * ~k * . 



******* ****** 



SEQ_17 8 

B cereus cytotoxis 



EKLDERKKITKLAKEQSEWIAAPLAEAELRAWVRRRIESQGAQASDEAIDVLLRRAGTQ 180 
EKLDERKKITKLLKKTAEWEANAMQVQDVRKWIVKRAEEVHVHVEEAAVSLLLELVGSN 178 



************ * 



* * * * 



* * 



, * * 



* * 



SEQ_17 8 

B cereus cytotoxis 



LSALANEIDKLALFAGSGGTIEAAAVERLVARTPEENVFVLVEQVAKRDIPAALQTFYDL 24 0 
VTMLAKEMDKLTLYVGTGGDITTKLVSELVPKSVEQNVFALTEKVVKKDIAGAMKILDGL 2 38 



************ * m * * * 



* * 



* . * * * * * * * * , * * * - * 



SEQ_17 8 

B cereus cytotoxis 



LENNEEPI KI LALLAAH FRLLS Q VKWLAS LG Y GQ AQ I AAAL KVH P FRVKLALAQAARFAD 300 
FTQQEEPIKLLALLVSQFRLLHQVKELQQRGYGQNQIASHIGVHPYRVKLAMNQTKLFSF 298 



* * * * * , * * ** 



* * * * * * * * 



* * * * * * * 



+ ** + ■****■** * . 



SEQ_17 8 

E cereus cytotoxis 



GELAEAINELADADYEVKSGAVDRRLAVELLLMRWGARPAQAGRHGRR 3 4 8 
EELKKV1 FELAEADYSMKTGKMDKKLVLEFFLMRLNHM 336 



CLUSTAL W (1.83) Multiple Sequence Alignments 



Sequence format is Pearson 
Seauence 1: SEQ 178 348 aa 

Sequence 2: B^anthracis 336 aa 

Start of Pairwise alignments 
Aligning- . . 

Sequences (1:2) Aligned, Score: 4 7 

Guide tree file created: [ /ebi/ext serv/clust aiw- 

work/ interact i ve / clustalw-2 00 60823-1 4 52 54 36. dnd] 
Start of Multiple Alignment 
There are 1 groups 
Aligning. . , 

Group 1: Sequences: 2 Score: 5427 

Alignment Score 958 

CLUSTAL-Alignment file created [ /ebi/extserv/clustalw-work/interactive/clustalw- 
2 00608 23-1 4 52 54 3 6. aln] 



SEQ_178 

B anthracis 



MLERVWGNIEKRRFSPLYLLYGNEPFLLTETYERLVNAALGPEEREWNLAVYDCEETPIE 60 
-MSDIHKKIKKKQFAPLYLLYGTEAFFINET1KL1TTEALEEEDREFNVVTYDLEEAYLE 5 9 



■*kr*krk'k-k-kkk k k 



k k 



k k km -k km * 



■k k k k 



SEQ_17 8 

B anthracis 



AALEEAETVPFFGERRVILIKHPYFFTSEKEKE1EHDLAKLEAYLKAPSPFS IVVFFAPY 12 0 
DVVEDARTLPFFGERKVLLIKSPLFLTSQKEK-LEQHIKILEEYIGEPSPFSILVFVAPY 118 

. -k * k k*kkkkkk.k + kkk k k - k k . * -k k . * .. . . kk * « - ** *** 



SEQ_17 8 

B anthracis 



EKLDERKKITKLAKEQSEVV1AAPLAEAELRAWVRRRIESQGAQASDEAIDVLLRRAGTQ 180 
E KL DE RKK I T KLLKKT AD X VEANAMQ VQ D VQKWI VARAEEGR VH I DNAAVS LLLEL VGS N 17 8 



* *k k -k -k k "k -k * ~k k kr k 



* k k. 



k k 



k k 



SEQ_i7 8 

B anthracis 



LSALANE I DKLALFAGSGGT I EAAAVERLVARTPEENVFVL VEQVAKRD I PAALQT FYDL 24 0 
VTMLAKEMDKLTLYVGMGGEITPKLVAELVPKSVEQNVFALTEKVVKKDIAGAMQILDGL 238 



kk-k + kkk*k* k k k k 



k k .« k . k k k k k . k k + k k k • k 



SEQ__17S 

B anthracis 



LENNEE P I KI LALLAAH FRLLSQVKWLAS LG Y'GQAQ I AAALKVH P FRVKL ALAQAARFAD 300 
FTQQE E P I KLL ALL VSQ FRL LHQVKELQQRG YGQNQ IAS H I G VH P YRVKLAMNQT KFF S F 2 98 



+ *kkkkk.kkkk , - k k k k k k k k 



k k k k k k k - - kkk.kkkkk 



SEQ_17 8 

B anthracis 



G E L AE AI NE L AD AD Y E VKS G A V D RRL A VE LL LMRWG ARP AQAG RH G RR 34 8 
EELKKV1 JELAEADYSMKTGKMDKKLVLEFFLMRLNHM 33 6 



CLOSTAL W (1.83) Multiple Sequence Alignments 



Sequence format is Pearson 
Sequence 1: SEQ_178 348 aa 

Sequence 2: Bee reus 336 aa 

Start of Pairwise alignments 
Aligning . * . 

Sequences (1:2) Aligned. Score: 47 

Guide tree file created: [ /ebi/ext serv/clust alw- 

work/interactive/clustalw-20060823-14 582093 -dnd] 
Start of Multiple Alignment 
There are 1 groups 
Aligning. . . 

Group 1: Sequences: 2 Score: 5422 

Alignment Score 953 

CLUSTAL-Alignment file created [ /ebi/ext serv/clus talw-wor k/ interact ive/ clustalw- 
20060823-14 58 2 0 93 .aln] 



SEQ_17 8 MLERVWGNIEKRRFSPLYLLYGNEPFLLTETYERLVMAALGPEEREWNLAVYDCEETPIE 60 

B_cereus -MS DI HKKI KKKQFAPLYLL YGTEAFFINET1 KLI TTEALEEEDREFNVVTYDLEEAYLE 5 9 

. * . ■ * * * r ».«.» , - - » . * • • » 

SEQ_178 AALEEAETVPFFGERRVILIKHPYFFTSEKEKE1EHDLAKLEAYLKAPSPFSI VVFFAPY 120 

B__cereus DVVEDARTLPFFGERKVLLIKSPLFLTSQKEK-LEQNIKILEEYiGEPSPFSILVFVAPY 118 

SEQ_17 8 EKLDERKK1TKLAKEQSEVV1AAPLAEAELRAWVRRRIESQGAQASDEAIDVLLRRAGTQ 18 0 

B^cereus EKLDERKKITKLLKKTADWEANAMQVQDVQKW1 VARADEVHVHIDHAAVSLLLELVGSN 17 8 

SEQ_ 17 8 LS ALANE I DKLAL FAGS GGT I E AAAVERL VART P E EN VFVLVE Q VAKRDI P AALQT F Y DL 24 0 

B_cereus VTMLAKEMDKLT L YVGMGG D I T PKL VAELV PKSVEQNVFALTEKVVKKD J AGAMQI L DGL 238 



* » ■■ 



SEQ_17 8 LENNEEPIKILALLAAH FRLLSQVKWLASLGYGQAQI AAALKVHPFRVKLALAQAARFAD 300 

B^cereus FTQQEEPIKLLALLVSQFRLLHQVKELQQRGYGQNQIASHIGVHPYRVKLAMNQTKFFSF 2 98 

SEQ_17 8 GELAEAINELADADYEVKSGAVDRRLAVELLLMRWGARPAQAGRHGRR 34 8 

B cereus EELKKVIMELAEADYSMKTGKMDKKLVLEFFLMRLNHM 33 6 



CLUSTAL W (1.83) Multiple Sequence Alignments 



348 aa 
336 aa 



4 7 



[ /ebi/ext serv/clustalw- 



Sequence format is Pearson 
Sequence 1: SEQ_17 8 
Sequence 2: B^thuringiensis 
Start of Pairwise alignments 
Aligning. . . 

Sequences (1:2) Aligned. Score: 
Guide tree file created; 

work/interactive/clustaiw-2 00 60 8 23-15064 397. dnd] 
Start of Multiple Alignment 
There are 1 groups 
Aligning . , , 

Group 1; Sequences: 2 Score: 5424 

Alignment Score 95 5 

CLUSTAL-Alignment file created [ /ebi /extserv/clustalw-work/interact ive/ clustalw- 
2006082 3-15064 397. aln] 



SEQ_17 8 

B thuringiensis 



MLERVWGNIEKRRFSPLYLLYGNEPFLLTETYERLVNAALGPEEREWNLAVYDCEETPIE 60 
-MSDlHKKIKKKQB'APLYLLYGTEAFFINETIKLITTEALEEEDREFNVVTYDLEEAYliE 5 9 



■k ~k k. - -k k * -k 



-k k * St 



SEQ__17 8 

B thuringiensis 



AALEEAETVPFFGERRVILIKHPYFFTSEKEKETEHDLAKLEAYLKAPSPFSIVVFFAFY 12 0 
DVVE DART LP FFGERKVLL I KS P L FL TS QKE K-L EQN I KI LEE Y I G E PS P FS I L V FVA P Y 118 



- k » -kw-k-k-k-k-k-k k -k^-k-km-k-k-k 



-k •k ~k 



k-kkk-kk-k* k k k 



SEQ_17 8 

B thuringiensis 



EKLDERKKI TKLAKEQS E WI AAP LAEAE LRAWVRRRI ESQGAQASDEAI DVLLRRAGTQ 180 
EKLDERKKITKLLKKTADWEANAMQVQDVQKWIVARADEVHVHIDNAAVSLLLELVGSN 17 8 



k -k * rk -k -k -k k k k k k k * 



•k -k ± 



■k * 



SEQ_17 8 

B thuringiensis 



LS ALAN E I D KLAL FAG S GG T I E AAAVE RL VART PEE N V F VL VEQ VAKRD I P AALQT FY D L 24 0 
VT MLAKEMDKLTL Y VGMGGE I TT KL VAEL VPKS VEQNVFALTEKVVKKD I AGAMQ I LDGL 2 38 



* -k-k.-Jt.-k-jf-ki-k-r -k k k k: 



■k k 



k * k -k k k k • k k * -k k * * * 



SEQ_178 

B thuringiensis 



L E NNE E P I K I LALLAAH FRLLSQ VKWL AS L G YGQAQ I AAALKVH P FR VKLAL AQ AAR FAD 300 
FTQQEEPIKLLALLVSQFRLLYQVKELQQRGYGQNQIASHIGVHPYRVKLAMNQTKFFSF 2 98 



+ -kk-k-kk,kkkk * . *r * * + -k -k -k * 



-k k * k k k kr 



-k-kkr^kkkkk* k 



S£Q_178 

B thuringiensis 



GE LAE A 1 N E LAD AD Y E VKS G AV D RRL AVE L L LMRWGARP AQAG RHGRR 34 8 
EELKKVI 1ELAEADYSMKTGKMDKKLVLEFFLMRLNHM 33 6 



CLUSTAL W (1.83) Multiple Sequence Alignments 



Sequence format is Pearson 

Sequence 1: SEQ_17 8 34 8 aa 

Sequence 2: B weihenstephanensis 336 aa 

Start of Pairwise alignments 
Aligning . . . 

Sequences (1:2) Aligned. Score: 4 6 

Guide tree file created: [/ebi/extserv/clustalw» 

work/interactive/clustalw-200 60 8 23-151 41 62 5, dnd] 
Start of Multiple Alignment 
There are 1 groups 
Aligning . , . 

Group 1: Sequences: 2 Score: 5407 

Alignment Score 94 5 

CLUSTAL-Alignment file created [/ebi/extserv/clustalw-work/interactive/clustalw- 
2 00 6082 3-151 41 625. aln] 



SEQ_178 

B weihenstephanensis 



SEQ_17 8 

B weihenstephanensis 



MLERVWGNIEKRRFSPLYLLYGNEPFLLTETYERLVNAALGPEEREWNLAVYDCEETPIE 60 
-MSD1HKKIKKKQFAPLYVLYG1EAYFINETIKLITTEALEEEDREFNVVTYDLEEAYLE 5 9 



"k k; *■ k k * k 



•k ~k ~k k 



AALEEAETVPFFGERRV1LXKHPYFFTSEKEKE1EHDLAKLEAYLKAP5PFSIVVFFAPY 120 
DVVEDAHTLPFFGERKVILIK3PLFLTAQKEK-LEQMIKILEEYIGEPSPFSILVFVAPY 118 



+ -k . k -k + -kkkkkk<.k-kk-kk -k -k*-k*,k-kk * * 



k k k 



k -k k kr k k f * k * * * 



SEQ_178 

B weihenstephanensis 



EKLDERKKI TKLAKEQSE VVI AAPLAEAELRAWVRRRi ESQGAQAS DEA1 DVLLRRAGTQ 180 
E KL DE RKK I T KLL KKT ADV I EAN AMQ VQ DVQKWI V8RA DE VH VH I D1SI AAV S LLL E L VG S N 17 8 



■k-k-kkTk-kkrk^kk'kk ~k » * • * » * 



w - 



k ~k 



SEQ_17 8 

B weihenstephanensis 



L SALANE I DKLAL FAGS GGT I E AAAVE RLVART PEEN VFVLVEQ VAKRD I PAALQT F Y D L 2 4 0 
VTMLAKEMDKL TL Y VGMG GE I T P KL VT EL VPK S VE QN V FALTE KVVKKD I AGAMQ I LDG L 2 38 



k -k k - -k * k -k k k 



k k .* -k . *k -k k: -k -k . k kr , k k * * + 



SEQ__17 8 

B weihenstephanensis 



LENNEEP1KI L ALLAAH FRLLSQVKWLAS LGYGQAQ 1 AAAL KVHP FRVKLAL AQAAR FAD 300 
FTQQEEP1 KLLALLVSQFRLLHQVKELQQRGYGQNQ1 ASHI GVHPYRVKLAMNQTKFFS F 2 98 



*kkrkrk*kr-kk* » . -k -k k k k k * k 



k ~k -k -k -k k -k 



kkrk*k-k kic fr+ k . 



SEQ_17 8 

B weihenstephanensis 



GELAEAINELADADYEVKSGAVDRRLAVELLLMRWGARPAQAGRHGRR 34 8 
EELKKVI IELAEADYSMKTGKMDKKLVLEFFLMRLNHM 336 



CLUSTAL W (1.83) Multiple Sequence Alignments 



Sequence format is Pearson 
Sequence 1: SEQ_178 348 aa 

Sequence 2: B_subtilis 347 aa 

Start of Pairwise alignments 
Aligning , , , 

Sequences {1:2} Aligned, Score: 4 3 

Guide tree file created: [ /ebi/extserv/clust alw- 

work/interactive/clustalw-2 00 60823-151 80 14 5 .dnd] 
Start of Multiple Alignment 
There are 1 groups 
Aligning. . * 

Group 1: Sequences: 2 Score: 54 01 

Alignment Score 905 

CLUSTAL-Alignment file created [ /ebi/extserv/clust alw-wor k/interactive/clustalw- 
2006082 3-1518 014 5 , aln] 



SEQ 178 MLERVWGN1EKRRFSPLYLLYGNEPFLLTETYERLVNAALGPEEREWNLAVYDCEETPIE 60 

B^subtilis MVFDVWKSLKKGEVHPVYCLYGKETYLLQETVSRIRQTWDQETKDFNLSVFDLEEDPLD 60 

* . * * . . * *-„****.* . * * * * * * * * - * 

SEQJL78 AALEEAETVPFFGERRVILIKHPYFFTSEKEKE-IEHDLAKLEAYLKAPSPFSIVVFFAP 119 

B_SUbtilis QAIADAETFPFMGERRLVIVKNPYFLTGEKKKEKIEHNVSALESYIQSPAPYTVFVLLAP 12 0 

* - . * * * ** r ****.... * * * + * 

SEQ_17 8 YEKLDERKK1TKLAKEQSEVVIAAPLAEAELRAWVRRRIESQGAQASDEAIDVLLRRAGT 17 9 

B_subtilis YEKLDERKKLTKALKKHAFMMEAKELNAKETTDFTVNLAKTEQKTIGTEAAEHLVLLVNG 13 0 

*********.** * . , » * * * : :;: . * * : * : 

SEQ_178 QLSAIANEIDKLALFAGSGGTIEAAAVERLVARTPEENVFVLVEQVAKRDIPAALQTFYD 23 9 

Bsubtilis HLSSIFQEIQKLCTFIGDREEITLDDVKMLVARSLEQNIFELINKIVNRKRTESLQIFYD 24 0 

SEQ_178 LLENNEE P I KI LALL AAH FRLL S Q VKWLAS LG YGQAQI AAALKVH P FR VKL ALAQ AARFA 2 99 

B subtilis LLKQNEEPIKIMALISNQFRL1LQTKYFAEQGYGQKQIASKLKVHPFRVKLAMDQARLFS 300 

•**..******^.**., , * * * - * * * . * **** ***. ************ ** *- 

SEQ_17 8 DGELAEAIMELADADYEVKSGAVDRRLAVELLLMRWGARPAQAGRHGRR 348 

B subtilis EEELRLIIEQLAVMDYEMKTGKKDKQLLLELFLLQLLKRNEKNDPHY — 347 



CLUSTAL W (1.83) Multiple Sequence Alignments 



Sequence format is Pearson 
Sequence 1: SEQ_173 348 aa 

Sequence 2: B_lichenif ormis 354 aa 

Start of Pairwise alignments 
Aligning. . . 

Sequences (1:2) Aligned. Score: 44 

Guide tree file created: [ /etai /ext serv/clus talw- 

work/interactive/clustalw-20060823-152325 93.dnd] 
Start of Multiple Alignment 
There are 1 groups 
Aligning . , . 

Group 1: Sequences: 2 Score: 54 4 0 

Alignment Score 922 

CLUSTAL-Alignment file created [/ebi/extserv/clustalw-work/interactive/clustalw 
2006082 3-152 32593. ain] 



SEQ 178 MLER VWGNI EKRRFSPLYLLYGNE PFLLTET YERLVHAALGPEEREWNLAVY 52 

B_lichenif ormis MKEGKVQMTVFDVWKSLKKGDIHPVYCLYGKETHLLQETVQRIRQAWDEETKDFNFSIF 60 

* * * * - . * * ******** ******.*** ********* 

■» r * j . • »**«» .*.■*»* 

SEQ 178 DCEETPIEAALEEAETVPFFGERRVILIKHPYFFTSEKEKE-IEHDLAKLEAYLKAPSPF 111 

B_lichenif ormis DLEE DALDLAVE DAE T F P FMGERRL VVAKN PAFLT AE KKKDKL EHNLGVLE A Y I TQPAP Y 12 0 

* * * .. *********-******- * . * * • ******- - * * * * * * * * . * . * . 

« • ■ . . . - . » . . - * * • " ■ ■ 

SEQ_17S SIVVFFAPYEKLDERKKITKLAKEQSEVVT7U\PLAEAELRAWVRRRIESQGAQASDEA1D 171 

B_lichenif ormis S V FVLLAP YEKLDERKKL T KLLKKHAHMVE AKE LTAKE T A D Fl AGL AKS E GKQ I E PEAAS 18 0 

* . *..***********. * . . . . * * * * * - ■ . * • * * * * - 
■ ■ * * » ..... * •• •• ■ ▼ 

SEQ_17 8 VLLRRAGTQLSALANEIDKLALFAGSGGTIEAAAVERLVARTPEENVFVLVEQVAKRDIP 231 

B_lichenif ormis E L VLL C H AS LS AI AQE VKKL S T YT GORGE I T I E D VRKL VARGLEQN I FEL I N KVVNRKRT 240 

* . .***,.*.*.****•* * * * - -A- . * * . , . * . * 

» ... - -* ....... 

SEQ_178 AALQT F YDL LE NNE E P I K I L AL LAAH FRLLSQ VKW LAS LG YGQAQ I AAAL KVH P FRVKL A 2 91 

B^lichenif ormis E ALQ I F YDL LKQNE E P I KMMAL I ANQ FRLLMQ T K Y FS DQG YGQ KQ I ASNLKVH P FRVKL A 30 0 

* * -Jr *****-^..**.* . * * * * * * . . . * * * * * * * . *********** 

T . .... ..... . 

SEQ_ 178 LAQAARFADGELAEAINELADADYEVKSGAVDRRLAVELLLMRWGARPAQAGRHGRR 3 4 8 
B licheniformis I EQARLFSEQELRNI I EQLAVMDYEMKTGKKDKQLLLELFLLKLLQPHEKNGLL 354 



CLUSTAL W (1,83) Multiple Sequence Alignments 



Sequence format is Pearson 
Sequence 1: SEQ_17 8 34 8 aa 

Sequence 2: B_halodurans 342 aa 

Start of Pairwise alignments 
Aligning. . . 

Sequences (1:2) Aligned. Score: 41 

Guide tree file created: [/ebi/ext serv/clust alw- 

wcrk/interactive/clustalw-2 00 6082 3-152 938 4 6 . dnd] 
Start of Multiple Alignment 
There are 1 groups 
Aligning . . . 

Group 1; Sequences: 2 Score: 52 63 

Alignment Score 85 6 

CLUSTAL-Alignment file created [ /ebi/ext serv/clust a Iw-work/int eract ive/clus t alw- 
2 00 608 2 3-152 938 4 6. aln] 



SEQ_178 -HLERVWGNIEKRRFSPLYLLYGNEPFLLTETYERLVNAALGPEEREWNLAVYDCEETPI 5 9 

B_halodurans MNYLKLKQDVQAGRVAPVYFMYGTELFLMEDLIQDILSVTLSDEERDMNVSSYSLTDVPI 60 

* . ... * kr * . * - - w -k -kr -k * -k . . rk -r k -k 

SEQ_178 EAALEEAETVPFFGERRVIL1KKPYFFTSEKEKEIEHDLAKLEAYLKAPSPFS IVVFFAP 119 

B_haloduranS EAALEEAETVPFFGSKRVVILKDAALFTSQK-LDVEKDVKRLEQY1LNPVPETVLLIMAP 119 

*-fr-k-k-krk-k-k*-k-k-k-kk • . * * * - * * * -k -k 



5EQ_17 8 

B halodurans 



YEKLDERKK1TKLAKEQSEWIAAPLAEAELRAWVRRRIESQGAQASDEAI DVLLRRAGT 17 9 
YEKLDERKKITKLIKKESLVLEAKPLDEGEAKAWLSSLASELQVEMDEKAIETLLGMTGL 179 



************** k * . k ~k • * -k k k k * * * 



m r ■k -k M -k ~k 



SEQ___17 8 

B halodurans 



Q L SAL AH E I D KLAL FAG S G G T I E AAA VE RL VART PE EN V F VL VEQV AKRD I P AAL QT FY D 23 9 
RLTQLASEMNKLALYVGEGGI1RSEDVTLLVAKTLDQNIFDLIDFAINQRTHQALSLYHE 23 9 



. -k . -k -k * * * + 



■k -k -k + -k 



-k - k * 



SEQ_17 8 

B halodurans 



L LENN E E P I K 1 L ALLAAH FRLLS QVFCWLAS LG YGQAQI AAALKVH P FRVKL ALAQAARFA 2 99 
LLKQKEEPLKLLALLTRQFRIMYQVKELGRRGYTPMQMAKPLKIHPYVAKLAGKKAASMS 299 



-k-k* k 



■k * 



~k • *k k "k * •k "k m * *k -k 

m * * Am 



■k -k 



SEQ_17 8 

B halodurans 



DGELAEAINELADADYEVKSGAVDRRLAVELLLMRWGARPAQAGRHGRR 3 4 8 
DQLLYSLIEKAADTEFAIKSGKVDKVLALELFLLTMGSRESVG 34 2 



CLUSTAL W (1.83) Multiple Sequence Alignments 



Sequence format is Pearson 
Sequence 1: SEQ_178 34 8 aa 

Sequence 2: B clausii 34 0 aa 

Start of Pairwise alignments 
Aligning, . , 

Sequences (1:2) Aligned. Score: 37 

Guide tree file created: [ /ebi /ext serv/clustalw- 

work/interactive/clustalw-2 0060823-15331074,dnd] 
Start of Multiple Alignment 
There are 1 groups 
Aligning* . . 

Group 1: Sequences: 2 Score: 4996 

Alignment Score 7 20 

CLUSTAL-Aiignment file created [ /ebi/extserv/clustalw-wor k/ interact ive/clustalw- 
20060823-15331074 .aln] 



SEQ_17 8 -MLERVWGNIEKRRFSFLYLLYGNEPFLLTETYERLVNAALGPEEREWNLAVYDCEETPI 5 9 

B_clausii MSIITEMKQLSSETMAPVYFLYGTERYLIEQFLQKASEALFAGEEGEMNDIRLRLQDTPL 60 

aa-r^a*-**^**-** - - * a . -k ~k k -k 

j. - - - - ' » * ' * ' 

SEQ_17 8 EAALEEAETVPFFGERRV1LIKHPYFFTSEKEK-EIEHDLAKLEAYLKAPSPFS1VVFFA 118 

B__clausii GEWEEAETVPFFSKRKLVVIQDFYLATSQKVATKLEHDI PAL EAYLDS PAGET VLWVA 12 0 

. -k*-k-k-k*-kkr-k * » -k -k . * * * * * k a T*r a a a a * * 

SEQ_17 8 P YE KLD E RKKI T KL AKE Q S E VV I AAPLAE AEL RAW VRRR I E S QG AQ AS DE A I DVLLRRAG 178 

B^ciausii PYEKLDERKKITKKLKKQTSAIDVNPFDEKETFAWMEAIAKQEGVAIEQDAKQLLYKRTG 18 0 

**-k-k~k>k-k-k-k-k-k-k-k -k ■ * * a * * * * * * • » • * - * k * - * - * a * 



SEQ_17 8 TQLSALANEIDKLALFAGSGGTIEAAAVERLVARTPEENVFVLVEQVAKRDIPAALOTFY 2 38 

B clausii PNLMLLHRELEKCMLYSKDGDPVTAS I VDELVAETVEQSVFTVI DYTAKGRAGDAVRLYH 24 0 



k ■, - -k -k * • * - * - * . -k -k -k * * a * * a a * * ^ 

, P - . ■. » a » » » » * • 



SEG_17 8 DLLENNEEPIKILALLAAHFRLLSQVKWLASLGYGQAQIAAALKVHPFRVKLALAQAARF 2 98 

B_clausii QLLRQKEEPLAILAELTRQFRQFFQVKTRLAKGYTQKEMASQLKLHPYVVKLAJMIQQVKAL 300 

- * * . » ^ * * . * * ^ * a « * * - * * * * * + * ■ • * • * * * * ». * 

, » * * • * a * * ... a a «a, 

SEQ JL7 8 ADGELAEAINELADADYEVKSGAVDRRLAVELLLMRWGARPAQAGRHGRR 34 8 
B_clausii T T KQLRQAL I VCE ET D YAI KTGQADKE LAVELLLLRLAS T 34 0 

* a ~k • -k * * » * -* * -jfc * * k » -k&kkr-k-kk~-k a 



